live attenuated bacteria of the species Aotinobacillus pleuropneumoniae. 

The present invention relates to live attenuated bacteria of the genus Actinobacillus 
pleuropneumoniae, having a mutation in a gene encoding a toxin, methods for the pro- 
duction of such bacteria, to vacdnes comprising such bacteria, methods for the production 
of such vaccines, to vaccines comprising a toxin, methods for the production of such vac- 
cines and methods for the protection of animals against infection with bacteria of the ge- 
nus Actinobacillus pleuropneumoniae. 

Bacteria belonging to the genus Actir^obacillus all produce so-called RTX-toxins. (RTX 
stands for repeat in toxin). 

It is the presence of these RTX-toxins that highly contributes to the pathogenic character 
of these bacteria. 

The RTX-toxins have been extensively reviewed by Braun et al. (Critical Rev. in Microbiol. 
18(2): 115-158 (1991)). RTX-toxins in Gram-negative strains have also been reviewed in 
Welch, R.A. (Molecular Microbiology 5/3: 521-528 (1991)) and in Welch et al. (Inf. Agents 
and Disease 4: 254-272 (1995)). 

All known RTX-toxins display some kind of cytotoxic or cytolytic activity. The target-cell- 
and host-spedficity differ however, depending on the toxin and on differences in acylation 
(McWhinney et al.; J. Bact. 174: 291-297 (1992) and Hackett et al.; J. Biol. Chem. 270: 
20250-20253 (1995)) . As a result of the difference in target cells, the various toxins of the 
RTX-toxin family are known e.g. as haemolysin, cytolysin or cytotoxin. 
The genus Actinobacillus comprises a number of different species, inter alia, Actinobacil- 
lus pleuropneumoniae, A, actinomycetemcomitans, A. suis, A. rossii, A. equuli and A. lig- 
nieresii. 

Actinobacillus pleuropneumoniae produces serotype-dependent RTX-toxins that are cyto- 
toxic/cytolytic to pig, horse, bovine and human erythrocytes, to rabbit and porcine neutro- 
phils and to porcine alveolar macrophages. (Rosendal et al; Am. J. Vet. Res. 49: 1053- 
1058 (1988), Maudsley J.R, and Kadis S; Can. J. Microbiol. 32: 801-805 (1986), Frey. J 
and Nicolet. J; Inf. & Imm. 56:2570-2575 (1988), Bendixon et al; Inf. & Imm. 33: 673-676 
(1981), Kamp, E.M. and van Leengoed. LA.M.G.; J. Clin. Microbiol. 27: 1187-1191 
(1989)). 

Infections with Actinobacillus in pigs are the cause of severe economic losses to pig indus- 
try, due to acute mortality in young pigs and reduced weight gain in older animals. 
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The genetic organisation of the operons involved in the synthesis, activation and transpor- 
tation of the RTX toxins in Gram-negative bacteria has been reviewed recently by Coote. 
J.G. (FEMS Microbiology reviews 88: 137-162 (1992)). Frey has specifically reviewed the 
known three RTX-toxins in Actinobacillus pleumpneumoniae in Bacterial Protein Toxins, 
Zbl Bakt. Suppl. 24. p. 322-. Freer et al. (Eds.). Gustaf Fischer. Stutttgart. Jena. New York. 
1994. 

In Actinobacillus pleumpneumoniae. this kind of RTX-operon contains four genes: the ac- 
tual Toxin-gene (A), an Activator-gene (C). and two genes (B and D) encoding proteins 
involved in secretion of the toxin into the surrounding medium. The primary translation- 
product of the Toxin-gene (A) is a non-toxic protein, of which the toxic activity is activated 
by the Activator-gene (C) product 

Until recently, it was assumed that only three RTX-toxins. all having the above-described 
genetic organisation or at least having the Toxin-gene (A) and Activator-gene (C) . existed 

in Actinobacillus species. 

These three RTX-toxins. Apxl, Apx-ll and Apx-lll have respectively a pronounced haemo- 
lytic activity (Apxl). a mild haemolytic activity (Apx-ll) or a macrophage-cytotoxic activity 
(Apx-lll). 

The various toxic activities are fairiy randomly divided over the serotypes. There are four 
subgroups: 

a subgroup A. represented by serotypes 1. 5. 9 and 11. producing Apxl and Apx-ll. 
a subgroup B. represented by serotypes 2. 3. 4, 6 and 8. producing Apx-ll and 
Apx-lll. 

a subgroup C. represented by serotype 10, producing Apxl only, 

a subgroup D. represented by serotype 7 and 12. producing Apx-ll only. 

It is known, that Apxl, -II. and -III all are essential elements in universal vaccines against 
Actinobacillus pleumpneumoniae infection: a vaccine not comprising at least Apxl. -II. and 
-III will not provide protection against all Actinobacillus pleumpneumoniae serotypes. Also, 
a vaccine not comprising at least the Apx-toxins of one specific serotype vy^ll not even in- 
duce protection against that single serotype. 
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Subunit vaccines based on in vitro synthesised RTX-toxins from A. pleuropneumoniae that 
lost their toxicity have been described earlier, e.g. in European Patent EP No. 0.354.628, 
in which subunit vaccines based upon a haemolysin and a cytotoxin of A. pleuropneumo- 
niae are disclosed, and in European Patent EP No 0.453.024, in which A, pleuropneumo- 
niae subunit vaccines based upon haemolysins. cytotoxins and outer membrane proteins 
are disclosed. 

There are however four important disadvantages to subunit vaccines in general: 

• high amounts of antigenic material are needed in order to adequately trigger the 
immune system. 

• usually, only B-cell immunity is triggered. 

• several protective antigens are only triggered in vivo, and therefore can not be present 
in subunit vaccines. 

• a live pathogenic bacterium has many important immunogenic molecules, such as 
Outer Membrane Proteins and capsular polysaccharides, all potentially important for 
protection and thus to be included in an efficient subunit vaccine. 

Next to the obvious problems mentioned under points one and two, especially the fourth 
point makes it difficult to make an efficient subunit vaccine. 

This is e.g. illustrated by the A. pleuropneumoniae subunit vaccine disclosed in European 
Patent EP No 0.453.024 mentioned above, in which four different subunits (three RTX- 
toxins and an outer membrane protein) are combined in one vaccine. 

It is clear, that in order to overcome the disadvantages of subunit vaccines against Pas- 
teureffaceae-infection, a live attenuated vaccine would be highly desirable. 
A live attenuated vaccine has the following advantages: 

it can be administered in low doses (it is self-replicating) 

it closely mimics the natural/wild-type infection 

it provides all the possible immunologically important antigens at the same time. 
Nevertheless. In spite of the clear advantages, no live vaccines based on Actinobacillus 
pleuropneumoniae were commercially available prior to the present invention. 
The reason for this lies in the following paradox: as mentioned before. ApxI. -II. and -III all 
are essential elements of universal vaccines against Actinobacillus pleuropneumoniae in- 
fection. Live vaccines therefore have to produce these three RTX-toxins. These three 



RTX-toxins are however strong virulence factors in all Actinobacillus species (see e.g. 
Coote, J.G.; FEMS Microbiology reviews 88: 137-162 (1992). Tascon et al.; Mol. Microbiol. 
14: 207-216 (1994)), Jansen et al.; Inf. & Imm. 63: 27-37 (1995)). 
Deletion of the RTX-toxins in order to attenuate the virulence of live App strains is techni- 
cally feasible, but this does not provide a solution for the dilemma: such RTX-negative 
strains would be useless as live attenuated vaccine strains since they do no longer induce 
immunity in the host against the haemolytic/cytotoxic activity of Actinobapillus 
pleuropneumoniae field strains. 

Vimlence factors that, although important in the induction of immunity, do play a less im- 
portant role in building up immunity than Apxl, -II and -III, and thus can in principle be de- 
leted are however currently not known. 

It would thus be highly desirable to have a site on the genome of App that attributes to 
vimlence and therefore leads to an attenuated App strain when modified, whereas at the 
same time it is. although useful in triggering immunity, dispensable from a vaccine point of 
view. No such sites are however cun-ently known. Moreover, it would be highly desirable if 
such a site would be universally present in ail App strains, instead of being restricted to 
certain serotypes. Such a site would then allow all different serotypes to be attenuated by 
deletion of that same site. 

It is one of the objectives of the present invention to provide such an attenuation site, uni- 
versally present in ail Actinobacillus pleuropneumoniae strains regardless their serotype. 

Recently, a new gene was found in a serotype 1 strain of Actinobacillus pleumpneumo- 
niae (Thesis T.J. Anderson Nov. 1995). 

Although this gene does not resemble the known Actinobacillus Apxl, -II and -III genes, it 
bears resemblance to RTX-toxin genes known from bacteria belonging to Neisseria men- 
ingitidis, for which reason it was named RTX-gene apxIV, The gene however differs in al- 
most all aspects from the three known RTX-toxin genes apxl, -11 and -/// present in the 
various species of the Actinobacillus family as described above. First of all, the genomic 
organisation is completely different. Secondly, there is no activator-mechanism as is found 
for the known Apx-toxins. In the third place, no specific in vivo haemolytic or cytotoxic ac- 
tivity could at that time be attributed to the gene, or it's possible gene product. 
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It was now surprisingly found that this gene, fully in contrast with the three known RTX- 
genes. is present in all bacteria of the species Actinobacillus pleumpneumoniae, regard- 
less their serotype. This was detemiined by hybridisation of a probe comprising apxIV 
coding sequences with restriction fragments of the DNA from Actinobacillus pleumpneu- 
moniae of all serotypes as described in Example 6 and 7. 

Unexpectedly it was found now. that apxIV deletion mutants are viable, but they behave 
less vinjient compared to their apx/V-possessing parent strains. 

Therefore, it was determined that the gene product, the ApxIV toxin is a virulence factor in 
all Actinobacillus pleumpneumoniae strains. This is an unexpected conclusion, since up 
until now, no effects at al. let alone effects possibly influencing viailence had been attrib- 
uted to the gene product in vivo. In fact, up until now there was not even proof that the 
gene was expressed in Actinobacillus pleumpneumoniae in vivo or in vitm anyway. 
It therefore is one of the merits of the invention that it was found that: 

• the apxIV gene Is present in all A. pleumpneumoniae strains regardless the serotype, 

• the apxIV gene product is a virulence factor in all A. pleumpneumoniae serotypes, 

• A. pleumpneumoniae strains with a deletion in the apxN gene are still viable but have 
a decreased vimlence without significantly irripairing the immunogenic properties of the 
strains. 

Therefore, the invention provides for the first time live attenuated bacteria of the spedes 
Actinobacillus pleumpneumoniae, that do not produce a functional ApxIV toxin. 

A functional ApxIV toxin is considered to be a protein that has all the characteristics of the 
ApxIV toxin as expressed in a wild-type bacterium, and is expressed at the wild-type level. 
Therefore, a non-functional ApxIV toxin is considered to be a toxin that lacks some or all 
of the characteristics of the ApxIV toxin as expressed in a wild-type bacterium, and/or is 
expressed at a level, insufficient to obtain wild-type effects of ApxIV toxin. 

The inability to produce the ApxIV toxin can e.g. be due to modificatfons In the coding se- 
quence encoding the ApxIV toxin. It may also be e.g. the result of modifications in regions 
known to be involved in transcription of the apx/V gene, such as the promotor region, or of 
modifications in regions involved in translation, such as the ribosome binding site. 
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The overall stmcture of the apx/\/ gene is given in figure 1. 

In this figure, the direct repeat regions, characteristic for ApxIV toxin are indicated by 
dashed boxes, whereas the also ApxIV-specific glycine-rich nonapeptide regions are indi- 
cated by black anx)ws. The repeats are found at the C-tenminal part of ApxIV, These char- 
acteristic features are present in all Actinobacillus pleumpneumoniae serotypes. The nu- 
cleic acid sequence and amino acid sequence of two serotypes are represented in SEQ. 
ID. No. 1-4. SEQ. ID. NO. 1 shows the nucleic acid sequence of the apxIV gene of App 
serotype 1 , and SEQ. ID. NO. 2 shows the matching amino acid sequence of the serotype 
1 ApxIV toxin. SEQ. ID. NO. 3 shows the nucleotide sequence of the apxIV gene of App 
serotype 3, whereas SEQ. ID. NO. 4 shows the matching ami.no acid sequence of the se- 
rotype 3 ApxIV toxin. Figure 2 shows the strikingly high level of conservation at amino acid 
level, especially in the N-terminal 650 amino adds, between the Apx-toxins of the various 
Actinobadllus pleumpneumoniae serotypes. This is also a remari<able characteristic of the 
apx/V genes. It is clear from figure 1, that a variation In the number of repeats at the C- 
terminal part of the toxin may occur, depending on the serotype. This variation accounts 
for the difference in size of the genes and encoded toxins obtained from the various sero- 
types. 

There may be some variation in nucleic add sequence even between apxIV genes iso- 
lated from different isolates of Actinobacillus pleumpneumoniae, belonging to the same 
serotype. This is due to natural variation well known in ttie art to exist in all organisms. It is 
possible that some amino adds in the ApxIV toxin encoded by Uie apx/\^ gene are re- 
placed by others in the ApxIV toxin of another serotype, while ttie polypeptide is not al- 
tered in its function. For instance, a polypeptide containing Asp at a certain site, and its 
variant containing Asn at the comparable site still have the same properties. This process 
in which an amino acid is replaced by an functionally analogous amino-add is called func- 
tional displacement. In this case the variant proteins are called functional variants. 
Another cause of variation is the phenomenon of degeneracy of the genetic code. Shortly, 
it means, that e.g. the amino acid glutamic add is coded for by both GAT and GAA. This 
phenomenon holds for all amino acids, except Met and Trp. Thus, it is obvious, ttiat e.g. 
the ApxIV toxin of serotype 1, as given in the present invention can not only.be coded for 
by the nudeotide sequence given in SEQ ID NO: 1 but also by a very large variety of otiier 
sequences, still all giving the same or functionally the same polypeptides. 



Therefore, a variant apxlV sequence encoding a polypeptide that is functionally compara- 
ble to the ApxlV toxin falls within the scope of the present invention. 

Live attenuated bacteria according to the invention can be obtained in several ways. 
One possible way of obtaining such bacteria is by means of classical methods such as the 
treatment of wild-type Actinobacillus pleumpneumoniae bacteria with mutagenic agents 
such as base analogues, treatment with ultraviolet light or temperature treatment. 
Strains that do not produce a functional ApxlV toxin do not or to a lesser extend induce 
anti-ApxIV toxin antibodies, and therefore can easily be selected in animal tests. 
The necessary antisemm can be obtained as described below in Example 3, 
Another possibility is to deliberately introduce, using recombinant DNA-technology, a well- 
defined mutation in the gene encoding the ApxlV toxin. Such a mutation may be an inser- 
tion, a deletion, a replacement of one nucleotide by another one or a combination thereof, 
with the only proviso that the mutated gene no longer encodes a functional ApxlV toxin. 
It can easily be seen, that especially mutations introducing a stop-codon in the open 
reading frame, or mutations causing a frame-shift in the open reading frame are very suit- 
able to obtain a strain which no longer encodes a functional ApxlV toxin. Such mutations 
can e.g. be made by means of in vitro site directed mutagenesis using the Transfonner® 
kit sold by Clontech. Many other standard recombinant DNA techniques such as digestion 
of the gene with a restriction enzyme, followed by endonuclease treatment and religation, 
are equally applicable. 

Therefore, in a preferred form, this embodiment of the invention relates to live attenuated 
bacteria in which the gene encoding the ApxlV toxin comprises a mutation. 

Well-defined mutations involving the deletion of fragments of the apx/V gene or even the 
whole gene, or the insertion of heterologous DNA-fragments, when compared to classi- 
cally induced mutations, have the advantage that they will not revert to the wild-type situa- 
tion. 

Thus, in a more preferred form, this embodiment of the invention refers to live attenuated 
bacteria in which the gene encoding the ApxlV toxin comprises an insertion and/or a dele- 
tion. 
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Given the large amount of vaccines given nowadays to pigs, it is clear that combined ad- 
ministration of several vaccines would be desirable, if only for reasons of decreased vac- 
cination costs. It is therefore very attractive to use live attenuated vaccine strains as a re- 
combinant carrier for heterologous genes, encoding antigens selected from other patho- 
genic micro-organisms or vimses. Administration of such a recombinant canier has the 
advantage that after administration of such a canier. immunity is induced against two or 
more diseases at the same time. The live attenuated bacteria according to the present in- 
vention provide a very suitable earner for heterologous genes, since the gene encoding 
the ApxIV toxin can be used as an insertion site for such heterologous genes. The use of 
the apxIV gene as an Insertion site has the advantage that at the same time the apxIV 
gene Is inactivated, and the newly introduced heterologous gene can be expressed in ac- 
cordance with the homologous Actinobacillus pleumpneumoniae genes. The construction 
of such recombinant carriers can be done routinely, using standard molecular biology 
techniques such as homologous recombination. Therefore, in an even more preferred em- 
bodiment, the present invention relates to live attenuated bacteria of the species Actino- 
badllus pleumpneumoniae that do not produce a functional ApxIV toxin, and in which 
there Is a heterologous gene inserted in the apxIVgene. Such a heterologous gene can. 
as mentioned above, e.g. be a gene encoding an antigen selected from other pathogenic 
micro-organisms or viaises. Another possibility Is to insert a gene encoding a protein In- 
volved in triggering the immune system, such as an interieukine or an interferone. 

In a still even more preferred fomi of the invention, the heterologous gene encodes one or 
more antigens selected from the group consisting of Porcine Reproductive Respiratory 
Syndrome (PRRS) vims. Pseudorabies viais. Porcine Influenza virus, Porcine Parvovirus, 
Transmissible Gastroenteritis virus, rotavirus, Escherichia coli. Erysipelothrix rhusio- 
pathiae, Pasteunella multocida, Bondetella bmnchiseptica, Haemophilus parasuis and 
Streptococcus suis. 

There is however a serious pitfall in expression of heterologous genes in recombinant car- 
riers: it is known that several proteins are toxic if they are expressed in heterologous bac- 
teria. Therefore, genes encoding such proteins can never be introduced in heterologous 
earners, since successful recombinants will eventually die as a result of the expression a 
certain amount of the heterologous gene. The P2-protein of Haemophilus influenzae, to 



9 



name just one example, can simply not be expressed in E. coli. (Munson et al.. Infect. & 
Immun. 57: 88-94 (1989)). It is one of the objectives of the present invention to offer a re- 
combinant live carrier that does not have this drawback. It vi^as unexpectedly found, that 
although the apxIV gene is efficiently expressed in vivo (see Example 5). it is not ex- 
pressed in vitm (see Example 4). This was concluded from the failure to show the pres- 
ence of ApxIV toxin in in vitro grown A. pleuropneumoniae cultures. This means that ApxIV 
expression is switched on or off. depending on the environment in which A. pleuropneu- 
moniae is grown. This feature offers an unexpected advantage over known live recombi- 
nant carriers: if the expression of the heterologous gene is brought under the control of 
the apx/V promoter, the live attenuated P. pleuropneumoniae carrier according to the In- 
vention can be grown in vitro to high densities, regardless the inserted heterologous gene, 
since the foreign gene will not be expressed under these conditions. After administering a 
number of bacteria to the host, the expression of the heterologous gene will start and at 
some time during replication or after the death of the bacterium it will become available to 
the immune system of the host. The heterologous gene to be expressed can be function- 
ally linked to the apxIV promoter by e.g. replacing the coding sequence of the apxIV gene 
by the coding region of the heterologous gene. It is not necessary to replace the whole 
apxIV gene: it suffices to replace the ATG-codon of ApxIV by the coding region of the het- 
erologous gene including its stop-codon. It is also possible to express a heterologous 
gene under the influence of the apx/V promoter by making a fusion construct. This can be 
made by inserting the heterologous gene in frame with the apx/V reading frame down- 
stream of the apxIV ATG codon. 

The wording "functionally linked to the apxIV promoter means that transcription of the 
heterologous gene starts at the apxIV promoter. 

It goes without saying that each location of the inserted heterologous gene in which it is 
functionally linked to the apx/V promoter falls within the scope of the invention. 
Therefore, the most preferred fom of this embodiment relates to live attenuated bacteria 
according to the present invention, carrying a heterologous gene that is functionally linked 
to the promoter region of the apxIV gene. 

The surprising finding that the native apxIV promoter is a switchable promoter that is 
switched off in vitm and switched on in vivo makes this promoter a very versatile expres- 
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sion tool both in it's natural host and as a heterologous promotor in other bacteria. When 
used as a heterologous promotor in other bacteria, the DNA comprising the promotor can 
be isolated from its host and transferred to a bacterium other than Actinobacillus 
pleurvpneumoniae. Another option that has now become feasible, is the cloning of several 
copies of the apxIV promotor each controlling the expression of another gene. This can be 
done in the host bacterium Actinobacillus pleuropneumoniae, but this principle of multiple 
copies is equally applicable to other bacteria. As mentioned above, the promotor can be 
used for the selective in vivo expression of one or more heterologous genes encoding an- 
tigens selected from other pathogenic micro-organisms or viruses. The promoter can also 
be used for the expression of a heterologous DNA sequence encoding a cytokine such as 
an interieukin, Tumor Necrosis Factor or an interferon. Several cytokines, e.g. interferons 
are known to play an important role as Immune modulators. Thus it may be advantageous 
to express such genetic information under the control of the apxIV promotor. 
Therefore, another embodiment of the Invention relates to a nucleotide sequence harbour- 
ing the promotor controling the expression of the apxIVgene. 

The switchable promotor that in the native situation controls the expression of the apxIV 
gene, was now found to be located in the DNA fragment between position 451 and 1132 
of SEQ ID NO: 5. It is clear, that those parts of this DNA fragment that are not essential 
promotor elements need not necessarily be present in the fragment Thus, shorter frag- 
ments of this DNA fragment in wrtiich the promotor activity is retained, are equally suitable 
for the expression of heterologous genes. Therefore, a more preferred fomi of this em- 
bodiment relates to a nucleotide sequence comprising the DNA fragment from position 
451 tol 132 of SEQ ID NO: 5 or a subfragment thereof still having promotor activity. 

Bacterial promoters all share two consensus regions, the so-called -10 and the -35 region. 
Although the flanking sequence of these consensus regions may to a certain extend influ- 
ence the efficiency of the promotor. it can be advantageous to use only that part of the 
promotor region that comprises the DNA fragment between -35 and the ATG codon. This 
DNA fragment is located between position 617 and position 641 of SEQ ID NO: 5. 
Therefore, in a more preferred fomi of this embodiment the invention relates to a nucleo- 
tide sequence comprising the DNA fragment from position 617 to 641 of SEQ ID NO: 5. 



The present invention also relates to ApxIV toxin as a subunit vaccine component. 
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Subunit vaccines will most probably comprise the three known Apx-toxins. This was men- 
tioned above. Since it was unexpectedly found, that the ApxIV toxin is however present in 
all A. pleumpneumoniae serotypes as mentioned above, it is a desirable additional com- 
ponent of subunit vaccines: neutralising antibodies raised against the ApxIV toxin provide 
protection against the ApxIV toxin produced by each and every Actinobadllus 
pleumpneumoniae strain, regardless the serotype. Therefore, another embodiment of the 
invention relates to subunit vaccines for the protection of animals against infection with a 
bacterium of the species Actinobadllus pleumpneumoniae, that comprise purified ApxIV 
toxin. The ApxIV toxin can be administered alone, or in combination with any or all of the 
toxins Apxl, -!! and -III mentioned above and/or e.g. in combination with Outer Membrane 
Proteins (OMRs) of Actinobadllus pleumpneumoniae. Such vaccines can easily be pre- 
pared by admixing ApxIV toxin in an amount sufficient to induce an immune response, and 
a pharmaceutically acceptable earner. Production of the ApxIV toxin is possible by intro- 
ducing-the apxIV gene in a suitable expression vector, expression of the gene and isola- 
tion of the toxin. Many versatile expression systems are known in the art, such as bacterial 
expression systems, baculovims expression systems and mammalian cell expression 
systems. In Example 3 it is described how to obtain the ApxIV toxin by expression of the 
gene in E. coll. 

Still another embodiment of the invention relates to live attenuated vaccines comprising 
live attenuated bacteria as described above for the protection of animals against infection 
with a bacterium of the species Actinobadllus pleumpneumoniae. Such vaccines can be 
obtained by admixing live attenuated bacteria with a phannaceutically acceptable earner. 
These vaccines comprise at least an immunogenically effective amount of the live attenu- 
ated producing bacterium according to the invention. Immunogenically effective means 
that the amount of live attenuated bacterium administered at the moment of vaccination is 
sufficient to induce in the host an effective immune response to vimlent forms of the RTX- 
toxin producing bacterium. The useful dosage to be administered will vary depending on 
the age, weight and animal vaccinated, the mode of administration and the type of patho- 
gen against which vaccination is sought The vaccine may comprise any dose of bacteria, 
sufficient to evoke an immune response. Doses ranging between 10' and 10" bacteria 
are e.g. very suitable doses. 
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The pharmaceutically acceptable carrier may be as simple as water, but it may e.g. also 
comprise culture fluid in which the bacteria were cultured. Another suitable canier is e.g. a 
solution of physiological salt concentration. Other examples of pharmaceutically accept- 
able carriers or diluents useful in the present invention include stabilisers such as SPGA. 
carbohydrates (e.g. sorbitol, mannitol. starch, sucrose, glucose, dextran), proteins such as 
albumin or casein, protein containing agents such as bovine serum or skimmed milk and 
buffers (e.g. phosphate buffer). 

Optionally, one or more compounds having adjuvant acUvity may be added to the vaccine. 
Adjuvantia are non-specific stimulators of the immune system. They enhance the immune 
response of the host to the invading pathogen. Examples of adjuvantia known In the art 
are Freunds Complete and Incomplete adjuvans, vitamin E, non-ionic block polymers, mu- 
ramyldipeptides. ISCOMs (immune stimulating complexes, cf. for Instance European Pat- 
ent EP- 109942). Saponins, mineral oil. vegetable oil. and Carbopol (a homopolymer). 
Adjuvantia. specially suitable for mucosal application are e.g. the E. coll heat-labile toxin 
(LT) or Cholera toxin (CT). 

Other suitable adjuvants are for example aluminium hydroxide, phosphate or oxide, oil- 
emulsions (e.g. of Bayol F (R) or Marcol 62 (R). saponins or vitamin-E solubilisate. 

Therefore, in a preferred fomn. the vaccines according to the present invention comprise 
an adjuvant. 

For administration to animals, the vaccine according to the presentation can be given inter 
alia intranasally, intrademially, subcutaneously, by aerosol or intramusculariy. 

There are several ways to store both subunits and live organisms. Storage in a refrigerator 
is e.g. a well-known method. Also often used is storage at -70°C in a buffer containing 
glycerol. Bacteria can also be kept in liquid nitrogen. Freeze-drying is another way of con- 
servation. Freeze-dried bacteria can be stored and kept viable for many years. Storage 
temperatures for freeze-dried bacteria may well be above zero degrees, without being 
detrimental to the viability. Freeze-drying is equally applicable for subunits. 
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Freeze-drying can be done according to all well-known standard freeze-drying procedures. 
Optional beneficial additives, such as e.g. skimmed milk, trehalose, gelatin or bovine se- 
aim albumin can be added in the freeze-drying process. 

Therefore, in a more preferred embodiment, the vaccine according to the present inven- 
tion is in a freeze-dried fomi. 

In an even more prefen-ed fomi of this embodiment, the vaccine additionally comprises 
one or more antigens selected from other pathogenic micro-organisms or viruses. 
Such a vaccine can be obtained by adding one or more antigens selected from other 
pathogenic bacteria or viruses to the live attenuated bacterium according to the Invention 
and a phamfiaceutically acceptable carrier as described above. 
Of course, it is possible to add not only one or more antigens, but also one or more of the 
whole pathogens as such, in an inactivated or live form. 

It can alternatively be obtained by cloning the genetic information encoding one or more 
antigens selected from other pathogenic micro-organisms or viruses into the live attenu- 
ated bacterium, using known recombinant DNA technology as described above. 
Such vaccines are of course less stressing for the animal to be vaccinated than separate 
vaccinations with each of the pathogens, both from a medical and a physical point of view. 

In a still even more prefen-ed form, these antigens are selected from the group consisting 
of Porcine Reproductive Respiratory Syndrome (PRRS) vims. Pseudo-rabies vims, Por- 
cine Influenza vims. Porcine Parvovims, Transmissible Gastroenteritis vims, rotavims, Es- 
cherichia coli, Erysipelothrix rfiusiopathiae, Pasteurella multocida, Bordetella bronchisep- 
tica, Haemophilus parasuis and Streptococcus suis. 

The invention also relates to methods for the preparation of a live attenuated bacterium of 
the species Actinobacillus pleuropneumoniae that is not capable of producing a functional 
ApxIV toxin. These method comprise the introduction of a mutation in the gene encoding 
the apxIV protein. Both classical mutation techniques, using mutagenic agents, and re- 
combinant DNA techniques well-known in the art for insertion, replacement or deletion of 
genetic information from the apxIV gene are applicable. 
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In a preferred form, the above mentioned methods are used for the introduction of a dele- 
tion. 

Methods for the preparation of a live attenuated vaccine according to the invention, that 
comprise admixing bacteria according to the invention with a pharmaceutically acceptable 
carrier are also part of the invention. 

Also falling within the scope of the invention are methods for the preparation of a subunit 
vaccine. Such methods comprise the mixing of purified ApxIV toxin with a pharmaceuti- 
cally acceptable carrier. 

Another generally acknowledged problem in the field of vaccination with live vacdnes is 
the following: the presence of antibodies against a certain pathogen in the serum of a host 
animal-indicates that the host has been infected with the pathogen, either in a vimlent or 
attenuated fonm. It is however impossible to discriminate between field-infected animals 
and animals vaccinated with a live vacdne strain. The live attenuated Actinobadllus 
pleurvpneumoniae according to the present invention offers a solution to this problem as 
follows: 

As described in Example 3, the apxIVgene of Actinobadllus pleurvpneumoniae serotype 
1 has been Isolated and expressed in a heterologous host cell. This expression product 
was subjected to PAAGE-geletectroforesis and then used for Western-blotting. The blots 
were incubated with convalescent serum obtained from a deliberately Actinobadllus 
pleuropneumoniae-infected pigs, and sera from field-strains. It was found, that the apxIV 
gene Is expressed in vivo in all Actinobadllus pleurvpneumoniae field strains tested. This 
implicates, that pigs infected with Actinobadllus pleurvpneumoniae will always have anti- 
bodies against the strain with which they were infected, regardless the serotype of the in- 
fectious strain. 

The live attenuated bacteria according to the present invention can, due to the deletion of 
the apxIVgene, no longer make ApxIV toxin. Therefore animals vacdnated with a live at- 
tenuated Adinobacillus pleurvpneumoniae strain according to the invention will not have 
antibodies against ApxIV toxin in their semm. 

In a comparative test, e.g. an ELISA test, such sera will therefore react with all immuno- 
genic Actinobadllus pleurvpneumoniae-proteins such as e.g. Apxl, II and/or III, but not 
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with ApxIV. Sera from pigs infected with an Actinobacillus pleuropneumoniae field strain 
however will react with all inrimunogenic Acf/nofeac///us p/eu/opneiimon/ae-proteins. includ- 
ing ApxIV. Therefore, the live attenuated Actinobacillus pleuropneumoniae according to 
the present invention turns out to be a very suitable marker vaccine, i.e. a vaccine strain 
that can be discriminated from a field strain. 

A diagnostic test for the discrimination between vaccine strains and field strains can be a 
simple ELISA-test in which purified ApxIV toxin is coated to the wall of the wells of an 
ELISA-plate. Incubation with serum from pigs to be tested, followed by e.g. incubation with 
a labelled anti-pig antibody can then reveal the presence or absence of antibodies against 

ApxIV toxin. 

Another example of a diagnostic test system is e.g. the incubation of a Western blot com- 
prising purified ApxIV toxin with semm of pigs to be tested, followed by detection of spe- 
cific anti-ApxIV antibodies. 

Therefore, diagnostic test for the discrimination between sera from pigs infected with Acti- 
nobacillus pleuropneumoniae field strains and from pigs vaccinated with a vaccine com- 
prising live attenuated vaccine Actinobacillus pleuropneumoniae strains according to the 
invention, that comprise purified ApxIV toxin, also fall within the scope of the invention. 

Still another problem seen in pig health care is the following: It is difficult to detenmine in a 
both quick and unambiguous manner if a pig is infected with Actinobacillus pleuropneu- 
moniae or A, su/s. or possibly a combination of both. Diagnostic tests for the specific de- 
tection of A. suis are currently not available. This is mainly due to the fact that A. 
pleuropneumoniae and A. suis share so many antigens. As an example may serve, that 
two highly antigenic Apx-toxins; Apxl and Apxil have highly conserved homologues in e.g. 
A. suis (Van Ostaayen et al., submitted for publication). 

The known RTX-genes, encoding the Apxl. -II and -III toxins or homologues are found in 
practically all members of the genus Actinobacillus, such as A. pleuropneumoniae, A. suis, 
A. rossii and A. equuli. Thus. It was initially assumed by the inventors, that the new RTX- 
toxin ApxIV would also be common to all members of the genus Actinobacillus, 
It was however found after testing a the swine-pathogenic Actinobacillus, again surpris- 
ingly in contrast with the known three RTX-genes, that this novel RTX-gene apxIV ls only 
present in the swine-pathogen Actinobacillus pleuropneumoniae. It is absent in all other 
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common swine pathogenic Actinobacillus species, and therefore it is also absent in Acti- 
nobadllus suis. See Example 6 and 7. 

Therefore, it was surprisingly noticed that the presence of antibodies against ApxIV in the 
semm of a pig Is a quick and unambiguous proof that the pig has been Infected with A. 
pleuropneumoniae, and not with A. suis or any other swine-pathogen Actinobacillus spe- 
cies. 

Thus the present invention also provides a diagnostic test based on the presence or ab- 
sence of antibodies against ApxIV. and therefore a discriminating test for specifically dis- 
tinguishing an infection with A. pleuropneumoniae from an infection with A. suis 
Such a test can e.g. be an ELISA test that comprises in separate wells the Apxl and -11 
toxins, present in both A. pleuropneumoniae and A. suis and the purified ApxIV toxin. Se- 
mm from A. suls-infected animals will react only with the wells comprising the Apxl and -II 
whereas A. pleuropneumoniae-mfeded animals will also react with the well comprising the 
purified ApxIV toxin. 

Example 1: 

Cloning and analysis of the apxIVaene of A. pleuropneum oniae serotype 1. 
Standard molecular biological procedures (plasmid DNA isolation, restriction digestion, 
agarose gel electrophoresis. Southern blotting, ligation, transformation, electroporation) 
were, unless stated othen«nse, essentially perfomned as described in Sambrook et al. 
(Molecular Cloning. A Laboratory Manual, Cold Spring Harbor. N.Y., 1989) or Ausubel et 
al., (Current Protocols In Molecular Biology. John Wiley & Sons. N.Y., 1987). PCR was 
perfonned essentially as described in Innis et al., (PCR protocols, A guide to Methods and 
Applications, Academic Press Inc., San Diego, 1990). Chromosomal DNA isolation was 
perfonned according to Pitcher et al., (Lett. Appl. Microbiol., 8;151-156. 1989). The origin 
of all A. pleuropneumoniae reference strains (serotype 1: strain 4074; serotype 2: strain 
SI 536: serotype 3: S1421; serotype 4 M62; serotype 5a: K17; serotype 5b: L20; serotype 
6: fema; serotype 7: WF83: serotype 8: 405; serotype 9: CVI13261; serotype 10: 13039; 
serotype 11: 56153 and serotype 12: 8329) is described by Frey and Nicolet, (J. Clin. Mi- 
crobiol., 28;232-236, 1990). A. pleuropneumoniae serotype 3 strain HV1t4 is a field iso- 
late (j.e. one of the serotype 3 strains tested in Beck et al., J. Clin. Microbiol., 32;2749- 
2754, 1994). Other >^cf»rJobac///i/s strains used; A. rossii: ATCC 27072; A. equuli: ATCC 
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19392; A. suis: ATCC 15558. Pasteurella haemolytica type 1 strain ATCC 14003 was 
used. 

E. CO// host strains used: XLI-blue (Stratagene. La Yolla. Ca; genotype: mcAl endAI 
gyrA96 thi-1 hsdR17 supE44 relAI lac [F proAB lacfZDM15 Tn10 (Tef)]) and 
HMS174(DE3) (AlVIS Biotechnology Ltd. Switzerland; genotype: F mcA {ckiz. mnn^) rif 
IDE3). 

On the basis of the preliminary sequence data obtained from the thesis of T.J. Anderson 
(University of Guelph. 1 995i. two primers, designated APXIVA-1 L (5'- 
TGGCACTGACGGTGATGA-3*) and APXIVA-1 R (5'-GGCCATCGACTCAActAT-3'). were 
synthesised. These primers were used in a PCR amplification, with chromosomal DNA 
from A. pleumpneumoniae serotype 3 strain HV114 and serotype 1 reference strain 4074 
as a template. With both strains a fragment of 442 bp was amplified. The fragment derived 
from the serotype 3 chromosomal DNA was labelled with Digoxigenin-11-dUTP 
(Boehringer Mannheim) according to the protocol of the manufacturer (this fragment was 
designated probe APXIVA. see fig. 4). The labelled probe was subsequently used to hy- 
bridize a Southern blot containing C/al digested chromosomal DNA from strain 4074. The 
probe hybridised with a fragment of approximately 8.0 kb. The apxIV gene from serotype 
1 strain 4074 was isolated by ligating C/al digested chromosomal DNA into C/al digested 
pBluescript II SIC (Stratagene USA). E. co// strain XLI-blue was transfomfied with the 
ligated DNA and transformants were selected on an LB plate with 100 mg/ml of ampidllin. 
Clones harbouring the apx/V were selected by colony hybridisation of a nitrocellulose rep- 
lica of the plate with the APXIVA probe. Thus, a plasmid designated pROK7 was isolated 
and shown to harbour a C/al insert of approximately 8 kb. The first 6736 bp of the aal in- 
sert were sequenced (SEQID 1) and an open reading frame of 4971 nucleotides was 
identified encoding a protein of 1657 amino acid residues (SEQID 2) vwth a predicted size 
of approximately 186 kD. The gene was designated apxlV_var1 (see fig. 3). 

Example 2: 

Cloning and analysis of the aox/V oene o f A. oleuropneumoniae serotype 3. 
The labelled probe APXIVA (mentioned in example 1) was used to hybridize a Southern 
blot containing C/al digested chromosomal DNA from sti^in HV114. The probe hybridised 
wiUi a fragment of approximately 7.0 kb. The isolated chromosomal DNA from HV114 was 
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digested with C/al. and ligated with C/al digested pBluescript II SIC (Stratagene USA). E. 
coli strain XL1-blue was transformed with the ligated DNA and transfonnants were se- 
lected on an LB plate with 100 mg/ml of ampidllin. Clones harbouring the apx/V were se- 
lected by colony hybridisation of a nitrocellulose replica of the plate with the APXIVA 
probe. Thus, a plasmid designated pR0K5 was isolated and shown to harbour a aal In- 
sert of approximately 7 kb. The insert was analysed by sequence analysis (SEQID 3). An 
open reading frame of 4146 bp was identified encoding a protein of 1382 amino acid resi- 
dues (SEQID 4). with a predicted size of approximately 154 kD. The gene was designated 
apxlV_var3 (see fig. 3) . 



Example 3: 

EXPRESSION OF ApxIV var3-polvhistidine fusio n proteins in E. coli 
From plasmid pR0K5. a deletion done was made which contains the 3' end of the apxIV 
gene, starting at the SamHI site (nudeotide No. 2747 in SEQ ID No: 3) up to the C/al site 
at the end of the insert downstream of the apxIV gene. This plasmid was designated 
pROKI. Using oligonudeotides APXIVAHISI-L (6- 
AGCCATATGGGCGATTTAAATTTCAG-3') and APXIVAHIS1-R (5'- 
TATGGATCCTCCGTGCTTliTGAGC-3') and DNA from plasmid pROK1 as a template, a 
DNA fragment of 2.1 kb was amplified (see fig. 4) containing the region from bp 3520 to 
5643 in apxlV_var3 (SEQID 3) flanked with Nde] and SamHI restriction sites at the 5' and 
3' end respectively. After doning of the Wcfel/BamHI digested PGR fragment in expression 
vector pETHIS-1. digested with the same enzymes, a plasmid designated 
pJFFapxlV6/10hls-1 was obtained. Plasmid pETHIS-1 is a derivative of pET14b (Novagen 
Inc., Madison. Wi.) where the multiple doning site has been extended and a region encod- 
ing a histidine decamer has been inserted. Consequently. The pJFFapxlV6/10his-1 plas- 
mid contains a translational fusion encoding a histidine hexamer, followed by amino add 
residues 653 up to 1360 from SEQID 4, followed by a histidine decamer, under the control 
of a T7 promoter. The plasmid was transfen-ed to E. coli strain HMS174(DE3) with pLysS, 
which contains an IPTG indudble T7 RNA polymerase gene as well as the T7 lysozyme 
gene for increased stability. The strain was grown in LB medium containing 25 mg/ml of 
chloramphenicol and 100 mg/ml of ampidllin, up to an ODsso of 0.5, and induced with iso- 
propyl-b-D-thiogalactopyranoside at a concentration of 10 mM. After the addition of IPTG. 
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the cells were incubated at ZTC for 2.5 hours, the cells were harvested by centrifugation. 
and fusion protein with the expected size of 80 kD was isolated in the form of inclusion 
bodies. The inclusion bodies were solubilized in a solution of 6M guanidine hydrochloride. 
300 mM NaCI and 50 mM NaH2P04 at pH 8.0 and the 80 kD fusion protein was further 
purified by Immobilised Metal Affinity Chromatography (IMAC) (Schmitt et al., Molecular 
Biology Reports 18;223-230. 1993) using Ni^* chelated columns (Qiagen AG, Basel). Pure 
protein was eluted from the column at pH 5,0. Pooled fractions were dialysed against a 
solution of 300 mM NaCI and 50 mM NaH2P04 at pH 8.0. A rabbit was immunised with 500 
mg of the polyhistidine fusion product, mixed with 1 volume of Complete Freunds Adjuvant 
(Difco Labs. Detroit, Ml). A booster dose of the same amount, mixed with incomplete Fre- 
und Adjuvant was given 3 weeks later. Four weeks after the booster, the rabbit was bled 
and a hyperimmune semm comprising anti-ApxIV toxin antibodies . designated serum 
522-409. was obtained. 

Example 4: 

Expression of apxIV genes in in wfro grown A. pleurvpneumoniae 
The A. pleurvpneumoniae reference strain from serotype 1 was grown in Columbia broth 
supplemented with 10 mg/ml of b-NAD and harvested as described (Beck et al., J. Clin. 
Microbiol.. 32;2749-2754, 1994). Adjacent to lanes comprising ApxIA, ApxIlA and ApxIVA- 
polyhistidine fusion proteins the concentrated culture supernatant was separated by poly- 
acrylamide gel electrophoresis (Laemmli, Nature 227:680-685, 1970) and subjected to a 
Westem blotting procedure (Towbin et al., Proc. Natl. Acad. Sci. USA 76:4350-4354, 
1979). The Westem blot was reacted with anti-ApxIA- and anti-ApxIlA monoclonal antibod- 
ies as described by Beck et al., (J. Clin. Microbiol., 32;2749-2754, 1994), and with anti- 
ApxIV semm 522-409 (see example 3). The isolated RTX toxin fraction of serotype 1 
clearly contains ApxIA and ApxIIA. The presence of ApxIVA could not be demonstrated 
(see fig. 5). 

Example 5: 

Expression of apxIV genes in A, pleuropneumoniae in vivo during infection 
A polyacrylamide gel containing the 80 kD polyhistidine-ApxlV_var3 fusion protein (see 
example 3) was transferred to a nitrocellulose membrane. The membrane was divided into 
strips which were reacted with (100-fold dilutions of) convalescent field sera against sero- 
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type 1 or sera from a pig, experimentally infected with the serotype 1 reference strain 
(Frey and Nicolet. Vet. Microbiol., 28;61-73, 1991). The reaction was visualised using al- 
kaline phosphatase-labelled conjugate against rabbit IgG (Kirkegaard Perry Inc.. Gaithers- 
burg, Md.) and NBT (4-Nitrobluetetrazolium chloride) and BCIP (5-Bromo-4-chloro-3- 
indolyl-phosphate) colour development (see fig. 6). The serotype 1 field sera, as well as 
semm from the experimentally infected pig react with the 80 kD polyhistidine-ApxlV_var3 
protein. This indicates that the ApxIV protein actually is expressed, is antigenic and in- 
duces anti-ApxIV toxin antibodies during A. pleumpneumoniae infection in pigs. 

Example 6: 

Presence of aoxlV oenes in all A. pleumpneumoniae serotypes and the absence thereof 
in non-pleuropneumoniae Acf/notectf/us-strains using Southern blotting 
To investigate the presence of the apxIV gene in the various A.pleumpneumoniae sero- 
types and related bacteria, three probes were made (see fig. 4). Probe APXIVA is de- 
scribed in example 1. Probe APXIVA2 contains the 2015 bp DNA fragment between the 
fiamHI and Nm\ sites. The 758 bp probe APPIVA1 was made by PGR amplification with 
oligos APPIV1-L (5'-GGGACAGTGGCTCAATT/Sg-3') and (APPIV1-R (5'- 
AGCTGTAAACTCCACCAACgS'). All probes were labelled with Digoxigenin-11-dUTP 
(Boehringer Mannheim) according to the protocol of the manufacturer and hybridised with 
Southern blots containing C/al digested chromosomal DNA of all A, pleumpneumoniae 
reference strains and the HV1 14 field strain, Actinobacillus suis (ATCC 15558), Actinoba- 
cillus mssii (ATCC 27072) and Actinobacillus equuli (ATCC 19392). All three probes react 
similariy (see fig. 7 for the results with the APXIVA2 probe). All A. pleumpneumoniae 
strains react, whereas no hybridisation is obsen^ed with the A. suis, A. equuli and A. mssii 
strains. 

Example 7: 

Presence of aox/V genes in A. pleumpneumoniae and related strains using PGR amplifi- 
cation 

With 50 ng of chromosomal DNA from the various A. pleumpneumoniae serotypes, other 
Actinobacillus species andP. haemolytica as templates, and primers APXIVA-1L (5- 



TGGCACTGACGGTGAtGA-3') and APXIVA-1R (ff-GGCCATCGACTCAACCAT-S') PCR 
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amplification was performed. After analysis of the products on an agarose gel, products 
with the expected size of 442 bp were observed in all A. pleumpneumoniae samples, but 
In none of the o\her Actinobacillus species (fig. 8). This indicates that in addition to the re- 
sults in example 6. also PGR could be used to discriminate A. pleumpneumoniae from 
other Actinobacillus species. 

Example 8: 

Overexpression of ApxIV-varl polvhistidine fusion protein. 
Starting with plasmid pROK-7 (see example 1) as a template and oligonupleobdes 
APX4/II5-L (5'-CGCCATA^ACAAAATTAACTATGCa!&) and APX4II6-R (5'-CGCGAAT 
TCAGCGACACAAGAGATAT) as PCR-primers, a PGR fragment was amplified. A suffi- 
cient amount of this fragment was then digested with restriction enzymes Ndel and EcoRI 
and cloned in exprBssion vector pETHIS-1, digested with the same enzymes as described 
in Example 3. From the resulting plasmid. designated pJFFApxIVAIHisl. a 206 kD poly- 
histidine fusion protein (MW determined in PAGE) of 1841 amino add residues was over- 
expressed in E. coll as described In Example 3. The protein Is encoded in the coding re- 
gion spanning nucleic acid no. 1 132 to 6546 as depicted in SEQ ID NO: 5. The amino acid 
sequence of the protein in given in SEQ ID NO: 6. In Western blot this product was shown 
to react with specific antl-ApxIV semm 522-409 (antiserum described in example 3). 

Example 9: 

Protection of mice bv vaccination with AdxIV ag ainst APP challenge. 
The 206 kD polyhistidine-ApxIV fusion protein as described in Example 8 and a compara- 
ble 108 kD polyhistidine-ApxIA-fusion protein, both derived from serotype 1 reference 
strain 4074 genomic material, were overexpressed as described in example 3. The cell 
pellet of Induced E. coli cells was resuspended into PBS buffer (1 g. cell pellet In 6 ml 
buffer) and sonicated on ice for two times 46 seconds on ice for lysis of the cells. After 
centrifugation for 20 minutes at 22.000 x g at 4''C, the supematant was discarded and the 
resulting pellet was washed with a solution of 3M urea (pH 6.3). The urea was removed 
after centrifugation for 20 minutes at 22.000 x g, and the resulting pellet was solubllized in 
6M GuanidiniumHGI (pH 8.0). The protein samples were standardised by specific protein 
content after densitometry of PAGE gels. The samples were diluted with PBS and fomiu- 
lated with an oil adjuvant. 
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Three groups of 15 mice each, were intraperitoneally immunised with the ApxIV antigen 
(36.3 microgram). ApxIA antigen (36.3 microgram), or the adjuvant alone. The vaccines 
were administered in a volume of 0.4 ml. Twenty-four days after the first vaccination, each 
group of mice was split in two groups of 7 and 8 mice which were boosted with half the 
amount of antigen. The groups of 7 mice were troosled intraperitoneally in a volume of 0.2 
ml and the groups of 8 mice were vaccinated intramusculariy with 0.1 ml in each hind leg. 
Thirteen days after the booster, the mice were challenged intraperitoneally vinth 1.5 lO" cfu 
of a viailent serotype 1 strain . 



Example 10: 

Poreforminq capacity of AdxIV: 

Freshly Induced E. coli cells expressing ApxIV were used as the source of protein for 
testing porefomiation in artificial lipid bilayers as described by Maier et al.. Infect. Immun.. 
64; 4415-4423(1996). The methods used for black lipid bilayer experiments have been 
described previously (Benz et al.; Biochim. Biophys. Acta 511: 305-319 (1978)). Mem- 
branes were formed from a 1 % solution of asolectin (soybean ledtin type IV-S from 
Sigma, St. Louis MO) in n-decane. Zero current membrane potentials were measured with 
a Keithley 610 C electrometer 5-10 min. after a 10-fold salt gradient was established 
across the membranes (Benz et al.; Biochim. Biophys. Acta 551: 238-247 (1979)). The 
presence of ApxIV resulted in a high frequency of pore fonnation in the presence of 0.5 % 
cholesterol, with an average single channel conductance (G) of 4 nS. 
These results indicate that the ApxIV Induces pores Into artificial bilayers and is therefore 
toxic to eukaryotic cells and thus is a virulence factor for A. pleuropneumoniae. 
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Legend to the figures: 

Figure 1: Comparison of ApxIVAvar! and ApxlVAvar3 (var stands for serotype). Graphic 
representation of the different features found in the C-terminal end. Dashed boxes repre- 
sent the direct repeat regions in ApxIVA. Bold vertical bars indicate the position of glycine 
rich nonapeptides. and DNA polymerase farmly B signatures are indicated by black trian- 
gles. The amino acid sequence YSDSANSKK represents the spacer sequence in the 
ApxlVAvar3 gene. The sequence segment of ApxIVAvarl which is deleted in ApxlVAvar3 
is also indicated in the figure. 

figure 2: Alignment of the amino add sequences of direct repeat 1 (A), direct repeal 2 (B) 
and direct repeat 3 (C) of ApxIVAvarl and ApxIVAvarS. The copies of each of the direct 
repeats 1, 2 and 3 are labeled by letters to distinguish them for the sequence comparison. 
Variant residues are shown in bold letters. 

Figure 3: Partial restriction maps of pBLac7 (Thesis of T.J. Anderson University of 
Guelph, 1995), pR0K7 and pR0K5. The different open reading frames (ORF's) are indi- 
cated by an-ows. The interrupted anx)w of lacZ in pR0K7 indicates partial sequencing of 
the gene. Potential rfio-independent transcription terminators are indicated (W). Potential 
transcription start sites are indicated by a triangle. Restriction sites: A= Asp7(X); B= 
eamHI; C= C/al; E= EcoRV; Ec= EcoRI; H= H/ndlll; N= Nm\] Nd= /Vdel; Nh= Nhe\; P= Psfl; 
S= Spel; Sm= Smal. 

Figure 4: Location of the various oligonucleotides and probes on the map of the 
apxIVAvarS gene. 

Figure 5: Expression of ApxIV in in vitro cultivated serotype 1 reference strain 4074. 
Panel A was reacted with the anti-ApxIA monoclonal antibody, panel B with anti-ApxIlA 
monoclonal antibody and panel C was reacted with anti-ApxIVA seaim 522-409. Lane 1 
contains ApxIA-polyhistidine fusion protein, lane 2 contains ApxIIA-polyhistidine fusion 
protein, lane 3 contains strain 4074 concentrated culture supernatant, lane 4 contains 
ApxIVA polyhistidine fusion protein. 
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Figure 6: Immunoblot showing the reactivities of sera from pigs which were experi- 
mentally infected with the reference strain of serotype 1 (lane 1) or pig sera from serotype 
1 field infections (lane 2-4) with the 80 kD polyhistidine-ApxIV fusion protein. As a positive 
rontrol (+). serum 522-409 was used, as a negative control (-). polyclonal rabbit semm 
against Apxl and Apxil (Frey et al.. Infect. Immun.. 57:2050-2056. 1989) was used as a 
1000-fold dilution. 

Figure 7: Southern blot of C/al digested genomic DNA hybridised with probe 
APXIVA2. Lanes 1-13: A. pleuropneumoniae reference strains; 1: serotype 1; 2: serotype 
2; 3. serotype 3; 4: serotype 4; 5: serotype 5a: 6: serotype 5b; 7: serotype 6; 8: serotype 
7 9: serotype 8; 10: serotype 9; 11: serotype 10; 12: serotype 11; 13: serotype 12; 14: 
HV1 14 field strain: 15: A. suis (ATCC 15558); 16: A. rossii (ATCC 27072); 17: A. equuli 
(ATCC 19392). Molecular size markers are indicated (in kilobasepalrs) on the left 

Figure 8: PCR amplification of apxIV using primers APXIVA-1 L and APXIVA-1 R. 
Lane assignments: lanes 1 to 13 contain the A. pleuropneumoniae reference strains from 
serotypes 1. 2. 3. 4. 5a, 5b. 6, 7. 8. 9. 10. 11 and 12 respectively; lane 14: strain HV114: 
lane 15: A. su/s ATCC 15558; lane 16: A. ross// ATCC 27072; lane 17: A. equuli ATCC 
19392; lane 18: A. lignieresii ATCC 49236; lane 19: P. haemolytica type 1 ATCC 14003. 
Molecular size maimers (in kilobasepalrs) are indicated on the left. 
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SEQUENCE LISTING 



d) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: AKZO Nobel N.V. 

(B) STREET: Velperweg 76 

(C) CITY: Arnhem 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP) : 6824 BM 

(ii) TITLE OF INVENTION: Live attenuated Actinobacillus 
pi europneumoni ae 

(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER^ READABLE FORM: 

(A) r^DIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: P(;- DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (BPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(11 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinobacillus pleuropneumoniae 

(B) STRAIN: 4074 (serotype 1 reference strain) 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: pROK7 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:1576 . .6549 

(D) OTHER INFORMATION :/codon_start« 1576 
/functions "RTX toxin" 
/products " ApxIV_varl " 
/gene= "apxIV_varl " 
/numbers 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATCGATATGC CGCCGGGTAC GGGCGATATC CAACTTACTC TTTCGCAACA AATTCCGGTT 60 

ACCGGTGCGG TGGTGGTAAC CACTCCGCAA GATATTGCGT TATTAGATGC GGTGAAAGGT 120 

ATTTCAATGT TCCAAAAAGT GTCGGTACCG GTCTTAGGTA TCATTGAAAA TATGAGCGTA 180 

CATATCTGCC AAAATTGCGG TCACCACGAA GATATTTTCG GCACCGGCGG TGCG6AGAAA 240 

GTGGCGAAGA AATACGGTAC TAAAGTATTA GGACAAATGC CGTTGCATAT TCGCTTACGT 300 
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GCCTATATTG AATTAGCGGC AAAAGTCGCT TCGGAATTAT ACTGGCAAGG TTCGGTTATC 420 

CCGTCTGAAA TTATGATTCG TGAAGTAAAA TAAGTTTTAA TAACCACGAA AACACAAAGA 480 

ACACAAGCGG TAGAATTTGC AGAAAAATTT GCAAATCCTA CCGCTTTTTT ATTAGTACGA 540 

TTCGCTGTTG GACTGCTATT TGATTTGGTT TGTCAGGATA TTATGTTATT GTAATGAAAT 600 

GTTAGTGAAT TATTTTTATT AATTTGAAAG GAAACAAAAT GAAAATAAAA AAACGTTACA 660 

TTGCGCTGTT GGTCTTAGGT GTCGTTATCA GCTATGCCTG GTATCAAAAT TATCAATX5GG 720 

AACAGCTGAT GTTAAGCX^GT TATTGTGAAA AGGACGGAAG TTATTTTGAT GATAGGCATA 780 

CGAAGCAAGA ACTGATTGAT AGGGCAATTA ACTATATGCT GGAGCATCAA TCTAAAAAAA 840 

CATACGATGC TTATACTGAT GAACCTTTAG AAATAAAACC ATATTTAACA ATAGAGGAAT 900 

TTAAAhAACT CAATCCAAAT TGTTGTGAAA TTACCTCATG GCCAGCAGAT GCAGTTCCAC 960 

AAGATTGGGA TGTTCGTGTG GAAGGTAAGG CATATAGGTA TGTAATCGTA AAATATTTAA 1020 

GAACCTTAGC AAATAGAGAA CCTGAACGAT GGGAAACTAG TATTGTTTTT GATAATTGCG 1080 

GCAATCCTAA AAGAGCAAGC TACTTATATT ATTTAAAGAG AGAAATTTAT TATGACAAAA 1140 

TTAACTATGC AAGATGTGAC CAATTTATAT TTATATAAAA CX3AAAACTCT ACCTAAAOAT 1200 

AGATTGGATG ATTCACTTAT TTCTGAAATA GGAAAAGGAG ATGATGATAT TGATAGAAAA 1260 

GAATTTATGG TGGGGCCGGG ACGTTTTGTG ACCGCTGATA ACTTTAGCX3T TGTAAGAGAT 1320 

TTTTTTAATG CTGGGAAATC ACGCATTATT GCGCCGCAAG TCCCGCCTAT TCGTTCACAQ 1380 

CAGGAAAAAA TCTTGGTCGG TTTAAAACCG GGCAAATATT CCAAAGCGCA GATATTCGAA 1440 

ATGCTGGGTT ATACGAAAGG CGGAGAAGTG GTAAATGGGA TGTTTGCCGG TGAAGTCCAG 1500 

ACATTAGGCT TTTATGACGA TGGCAAAGGG GATTTACTCG AACGCGCCTA TATCTGGAAT 1560 

ACCACAGGAT TTAAA ATG AGC GAC AAT GCC TTT TTT GTT ATA GAA GAA TCA 1611 
Met Ser Asp Asn Ala Phe Phe Val lie Glu Glu Ser 
1 5 10 

GGC AAA CGC TAT ATT GAA AAC TTT GGT ATT GAA CCT CTT GGT AAG CAA 1659 
Gly Lys Arg Tyr lie Glu Asn Phe Gly lie Glu Pro Leu Gly Lys Gin 
15 20 25 

GAA GAT TTT GAT TTT GTC GGC GGC TTT TGG TCT AAC TTA GTG AAT CGT 1707 
Glu Asp Phe Asp Phe Val Gly Gly Phe Trp Ser Asn Leu Val Asn Arg 
30 35 40 



GGT TTG GAA AGT ATT ATC GAC CCA TCC GGT ATC GGT GGA ACG GTA AAC 1755 
Gly Leu Glu Ser lie lie Asp Pro Ser Gly lie Gly Gly Thr Val Asn 
45 50 55 60 

CTT AAC TTT ACC GGC GAG GTG GAA ACC TAG ACG TTA GAC GAA ACA AGG 1803 
Leu Asn Phe Thr Gly Glu Val Glu Thr Tyr Thr Leu Asp Glu Thr Arg 
65 70 75 . 

TTT AAA GCG GAA GCG GCG AAG AAA AGC CAT TGG AGT TTA GTG AAT GCG 1851 
Phe Lys Ala Glu Ala Ala Lys Lys Ser His Trp Ser Leu Val Asn Ala 
80 85 90 

GCG AAA GTA TAC GGC GGT TTA GAC CAA ATT ATT AAA AAA CTA TGG GAC 1899 
Ala Lys Val Tyr Gly Gly Leu Asp Gin lie lie Lys Lys Leu Trp Asp 
95 100 105 
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AGT GGC TCA ATT AAG CAT TTA TAT CAA GAT AAA GAT ACG GGC AAA TTA 1947 
Ser Gly Ser He Lys His Leu Tyr Gin Asp Lys Asp Thr Gly Lys Leu 
110 115 120 

AAA CCG ATT ATT TAC GGC ACG GCC GGC AAC GAC AGT AAG ATT GAA GGC 1995 
Lys Pro He He Tyr Gly Thr Ala Gly Asn Asp Ser Lys He Glu Gly 
125 130 135 140 

ACT AAA ATC ACC CGT AGG ATT GCG GGT AAA GAA GTT ACG CTT GAT ATT 2043 
Thr Lys He Thr Arg Arg He Ala Gly Lys Glu Val Thr Leu Asp He 
145 150 155 

GCC AAT CAG AAA ATT GAA AAA GGC GTG TTA GAG AAA TTG GGG CTG TCT 2091 
Ala Asn Gin Lys He Glu Lys Gly Val Leu Glu Lys Leu Gly Leu Ser 
160 165 170 

GTT AGT GGT TCG GAT ATC ATT AAA TTG TTG TTT GGA GCA TTG ACT CCA 2139 
Val Ser Gly Ser Asp He He Lys Leu Leu Phe Gly AJLa Leu Thr Pro 
175 180 185 

ACT TTA AAT AGA ATG TTG CTA TCA CAA CTT ATC CAG TCT TTT TCC GAT 2187 
Thr Leu Asn Arg Met Leu Leu Ser Gin Leu He Gin Ser Phe Ser Asp 
190 195 200 

AGC TTG GCT AAA CTT GAT AAT CCC TTA GCC CCT TAC ACT AAA AAT GGC 2235 
Ser Le^i Ala Lys Leu Asp Asn Pro Leu Ala Pro Tyr Thr Lys Asn Gly 
205 ' 210 215 220 

GTG GIT TAT GTC ACC GGC AAA GGG AAT GAT GTG CTT AAA GGA ACT GAA 2283 
Val Val Tyr Val Thr Gly Lys Gly Asn Asp Val Leu Lys Gly Thr Glu 
225 230 235 

CAT GAG OAT TTC TTT CTC GGT GGT GAG GGG AAT GAT ACT TAT TAT GCG 2331 
His Glu Asp Leu Phe Leu Gly Gly Glu Gly Asn Asp Thr Tyr Tyr Ala 
240 245 250 

AGA GTA GGC GAT ACA ATT GAA GAC GCC GAC GGC AAA GGT AAA GTC TAT 2379 
Ara Val Gly Asp Thr He Glu Asp Ala Asp Gly Lys Gly Lys Val Tyr 
^ 255 260 265 

TTT GTG AGA GAA AAA GGG GTA CCT AAG GCG GAT CCT AAG CGG GTA GAG 2427 
Phe Val Arg Glu Lys Gly Val Pro Lys Ala Asp Pro Lys Arg Val Glu 

270 275 280 

TTT AGC GAG TAC ATA ACG AAA GAA GAA ATA AAA GAG GTT GAA AAG GGG 2475 
Phe Ser Glu Tyr He Thr Lys Glu Glu He Lys Glu Val Glu Lys Gly 
285 290 295 300 

TTA TTA ACT TAC GCA GTT TTA GAA AAT TAT AAT TGG GAA GAG AAA ACG 2523 
Leu Leu Thr Tyr Ala Val Leu Glu Asn Tyr Asn Trp Glu Glu Lys Thr 
305 310 315 

GCG ACT TTC GCT CAT GCG ACT ATG CTT AAT GAG CTT TTT ACT GAT TAT 2571 
Ala Thr Phe Ala His Ala Thr Met Leu Asn Glu Leu Phe Thr Asp Tyr 
320 325 330 

ACT AAT TAT CGT TAT GAA GTT AAA GGA CTA AAA TTG CCC GCC GTT AAA 2619 
Thr Asn Tyr Arg Tyr Glu Val Lys Gly Leu Lys Leu Pro Ala Val . Lys 
335 340 345 

AAG TTA AAA AGT CCG TTG GTG GAG TTT ACA GCT GAT TTA TTA ACT GTT 2667 
Lys Leu Lys Ser Pro Leu Val Glu Phe Thr Ala Asp Leu Leu Thr Val 
350 355 360 

ACG CCT ATT GAC GAA AAC GGA AAA GCA CTT AGC GAA AAA AGT ATT ACG 2715 
Thr Pro He Asp Glu Asn Gly Lys Ala Leu Ser Glu Lys Ser He Thr 
365 370 375 380 
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GTT AAA AAT TTT AAA AAT GGT GAT TTA GGA ATA AGG TTG TTG GAT OCT 2763 
Val Lys Asn Phe Lys Asn Gly Asp Leu Gly He Arg Leu Leu Asp Pro 
385 390 

AAT AGC TAT TAT TAT TTC CTT GAA GGC CAA GAT ACG GGT TTT TAT GGT 2811 
Asn Ser Tyr Tyr Tyr Phe Leu Glu Gly Gin Asp Thr Gly Phe Tvr Glv 
400 405 410 

CCT GCT TTT TAT ATT GAA CGA AAA AAC GGT GGC GGC GCT AAA AAT AAC 2859 
Pro Ala Phe Tyr He Glu Arg Lys Asn Gly Gly Gly Ala Lys Asn Asn 
415 420 425 

TCG TCG GGA GCA GGA AAT AGC AAA GAT TGG GGC GGG AAC GGG CAT GGA 2907 
Ser Ser Gly Ala Gly Asn Ser Lys Asp Trp Gly Gly Asn Gly His Glv 
430 435 440 

AAT CAC CGA AAT AAT GCC TCC GAC CTG AAT AAA CCG GAC GGA AAT AAT 2955 
Asn His Arg Asn Asn Ala Ser Asp Leu Asn Lys Pro Asp Gly Asn Asn 
445 450 455 460 

GGG AAT AAC CAA AAT AAC GGA AGC AAT CAA GAT AAT CAT AGC GAT GTG 3003 
Gly Asn Asn Gin Asn Asn Gly Ser Asn Gin Asp Asn His Ser Asp Val 
465 470 475 

AAT GCG CCA AAT AAC CCG GGA CGT AAC TAT GAT ATT TAC GAT CCT TTA 3051 
Asn Ala Pro Asn Asn Pro Gly Arg Asn Tyr Asp He Tyr Asp Pro Leu 
480 485 490 

GCT TTA GAT TTA GAT GGA GAT GGG CTT GAA ACC GTG TCG ATG AAC GGG 3099 
Ala Leu Asp Leu Asp Gly Asp Gly Leu Glu Thr Val Ser Met Asn Gly 
495 500 505 

CGA CAA GGC GCG TTA TTC GAT CAT GAA GGA AAA GOT ATT CGT ACC GCA 3147 
Arg Gin Gly Ala Leu Phe Asp His Glu Gly Lys Gly He Arg Thr Ala 
510 515 520 

ACG GGC TGG CTC GCT GCG GAT GAC GGT TTT TTA GTG TTA GAT CGT AAC 3195 
Thr Gly Trp Leu Ala Ala Asp Asp Gly Phe Leu Val Leu Asp Arg Asn 
525 530 535 540 

CAA GAC GGC ATT ATT AAT GAT ATA AGC GAG TTA TTT AGT AAT AAA AAT 3243 
Gin Asp Gly He He Asn Asp He Ser Glu Leu Phe Ser Asn Lys Asn 
545 550 555 

CAA CTT TCC GAC GGC AGT ATT TCT GCA CAC GGT TTT GCG ACA TTA GCC 3291 
Gin Leu Ser Asp Gly Ser He Ser Ala His Gly Phe Ala Thr Leu Ala 
560 565 570 

GAT TTG GAT ACA AAC CAA GAT CAG CGT ATC GAC CAA AAT GAT AAG CTG 3339 
Asp Leu Asp Thr Asn Gin Asp Gin Arg He Asp Gin Asn Asp Lys Leu 
575 580 585 

TTT TCT AAA CTC CAA ATT TGG CGG GAT TTA AAT CAA AAC GGT TTT AGT 3387 
Phe Ser Lys Leu Gin He Trp Arg Asp Leu Asn Gin Asn Gly Phe Ser 
590 595 600 

GAA GCG AAT GAG CTG TTT AGC TTA GAA AGT TTG AAT ATT AAA TCT TTA 3435 
Glu Ala Asn Glu Leu Phe Ser Leu Glu Ser Leu Asn He Lys Ser Leu 
605 610 615 620 

CAT ACC GCC TAT GAA GAG CGT AAT GAT TTT CTA GCG GGC AAT AAT ATC 3483 
His Thr Ala Tyr Glu Glu Arg Asn Asp Phe Leu Ala Gly Asn Asn He 
625 630 635 

CTT GCT CAG CTT GGG AAG TAT GAA AAA ACG GAC GGT ACT TTT GCA CAA 3531 
Leu Ala Gin Leu Gly Lys Tyr Glu Lys Thr Asp Gly Thr Phe Ala Gin 
640 645 650 
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ATG GGC GAT TTA AAT TTC AGT TTT AAC CCG TTT TAT AGC CGA TTT ACC 
Met Gly Asp Leu Asn Phe Ser Phe Asn Pro Phe Tyr Ser Arg Phe Thr 
655 660 665 

GAA GCG TTA AAT TTA ACC GAG CAA CAA CGT CGC ACA ATT AAT CTA ACC 
Glu Ala Leu Asn Leu Thr Glu Gin Gin Arg Arg Thr He Asn Leu Thr 
670 675 680 



3579 
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GGC ACC GGT CGG GTT CGG GAT TTG CGT GAA GCC GCC GCA CTT TCT GAG 3675 
Gly Thr Gly Arg Val Arg Asp Leu Arg Glu Ala Ala Ala Leu Ser Glu 
685 690 695 700 

GAG TTG GCT GCT TTA TTA CAA CAG TAC ACT AAG GCC TCC GAT TTT CAG 3723 
Glu Leu Ala Ala Leu Leu Gin Gin Tyr Thr Lys Ala Ser Asp Phe Gin 
705 710 715 

GCA CAA CGA GAA TTA TTG CCT GCC ATT TTA GAT AAA TGG GCG GCA ACG 3771 
Ala Gin Arg Glu Leu Leu Pro Ala He Leu Asp Lye Trp Ala Ala Thr 
720 725 730 

GAT TTA CAG TAT CAA CAT TAT GAT AAA ACA TTA CTT AAA ACG GTA GAA 3819 
Asp Leu Gin Tyr Gin His Tyr Asp Lys Thr Leu Leu Lys Thr Val Glu 
735 740 745 

AGT ACC GAT AGT AGT GCT TCT GTC GTT AGA GTC ACG CCT TCT CAA TTA 3867 
Ser Thr Asp Ser Ser Ma Ser Val Val Arg Val Thr Pro Ser Gin Leu 
750' 755 760 

AGT AGT ATA CGC AAT GCA AAG CAT GAT CCT ACC GTT ATG CAA AAC TTT 3915 
Ser Ser He Arg Asn Ala Lys His Asp Pro Thr Val Met Gin Asn Phe 
765 770 775 780 

GAA CAG AGT AAG GCA AAA ATT GCG ACT TTA AAT TCG CTC TAC GGG TTA 3963 
Glu Gin Ser Lys Ala Lys He Ala Thr Leu Asn Ser Leu Tyr Gly Leu 
785 790 795 

AAT ATC GAT CAA CTT TAT TAC ACG ACG GAT AAA GAC ATT CGC TAT ATT 4011 
Asn He Asp Gin Leu Tyr Tyr Thr Thr Asp Lys Asp He Arg Tyr He 
800 805 810 

ACT GAT AAA GTG AAT AAT ATG TAT CAA ACA ACC GTA GAA CTT GCC TAC 4059 
Thr Asp Lys Val Asn Asn Met Tyr Gin Thr Thr Val Glu Leu Ala Tyr 

815 820 825 

CGT TCT TTA CTT TTA CAA ACG CGT TTG AAG AAA TAT GTT TAT AGC GTT 4107 
Arg Ser Leu Leu Leu Gin Thr Arg Leu Lys Lys Tyr Val Tyr Ser Val 
830 835 840 

AAT GCG AAA CAA TTC GAA GGG AAA TGG GTA ACC GAT TAT TCT CGT ACT 4155 
Asn Ala Lys Gin Phe Glu Gly Lys Trp Val Thr Asp Tyr Ser Arg Thr 
845 850 855 860 

GAA GCC TTA TTT AAC TCT ACT TTT AAA CAA TCG CCT GAA AAT GCA TTA 4203 
Glu Ala Leu Phe Asn Ser Thr Phe Lys Gin Ser Pro Glu Asn Ala Leu 
865 870 875 

TAT GAT TTA AGC GAA TAC CTT TCT TTC TTT AAC GAT CCT ACG GAA TGG 4251 
Tyr Asp Leu Ser Glu Tyr Leu Ser Phe Phe Asn Asp Pro Thr Glu Trp 
880 885 890 

AAA GAA GGG CTA TTA CTG TTA AGC CGT TAT ATA GAT TAT GCT AAA GCA 4299 
Lys Glu Gly Leu Leu Leu Leu Ser Arg Tyr He Asp Tyr Ala Lys Ala 
895 900 905 

CAA GGA TTT TAT GAA AAC TGG GCG GCT ACT TCT AAC TTA ACT ATT GCC 4347 
Gin Gly Phe Tyr Glu Asn Trp Ala Ala Thr Ser Asn Leu Thr He Ala 
910 915 920 
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CGT TTA AGA GAG GCT GGA GTA ATT TTT GCA GAA TCG ACG GAT TTA AAA 
Arg Leu Arg Glu Ala Gly Val He Phe Ala Glu Ser Thr Asp Leu Lys 
925 930 935 940 



4395 



GGC GAT GAA AAA AAT AAT ATT TTG TTA GGT AGC CAA AAA GAT AAT AAC 
Gly Asp Glu Lys Asn Asn He Leu Leu Gly Ser Gin Lys Asp Asn Asn 
945 950 955 
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TTA TCG GGT AGT GCA GGT GAT GAT CTA CTT ATC GGC GGA GAG GGT AAT 
Leu Ser Gly Ser Ala Gly Asp Asp Leu Leu lie Gly Gly Glu Gly Asn 
960 965 970 
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GAT ACG TTA AAA GGC AGC TAC GGT GCA GAC ACC TAT ATC TTT AGC AAA 
Asp Thr Leu Lys Gly Ser Tyr Gly Ala Asp Thr Tyr lie Phe Ser Lys 
975 980 985 
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GGA CAC GGA CAG GAT ATC GTT TAT GAA GAT ACC AAT AAT GAT AAC CGC 
Gly His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg 
990 995 1000 
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GCA AGA GAT ATC GAC ACC TTA AAA TTT ACC GAT GTG AAT TAT GCG GAA 
Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala Glu 
1005 1010 1015 1020 
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GTG AAG TTT CGA CGA GTA GAT AAT GAC TTA ATG TTA TTC GGT TAT CAT 
Val Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His 
1025 1030 1035 
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GAT ACG GAT TCG GTC ACG GTA AAA TCC TTC TAC AGC CAT GTA GAT TAT 
Asp Thr Asp Ser Val Thr Val Lys Ser Phe Tyr Ser His Val Asp Tyr 
1040 1045 1050 
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CAA TTT GAC AAA TTG GAG TTT GCT GAC CGC AGT ATA ACT CGC GAT GAA 
Gin Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser He Thr Arg Asp Glu 
1055 1060 1065 
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CTG ATT AAA GCA GGG CTT CAT CTA TAC GGC ACC GAT GGC AAT GAT GAT 
Leu He Lys Ala Gly Leu His Leu Tyr Gly Thr Asp Gly Asn Asp Asp 
1070 1075 1080 
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ATA AAG GAT CAT GCG GAT TGG GAC AGC ATT TTG GAA GGC GGC AAA GGC 
He Lys Asp His Ala Asp Trp Asp Ser He Leu Glu Gly Gly Lys Gly 
1085 1090 1095 1100 
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AAC GAT ATT CTA AGA GGT GGC TAC GGT GCG GAC ACC TAT ATC TTT AGC 
Asn Asp He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe Ser 
1105 1110 1115 
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AAA 
Lys 


GGA 
Gly 


CAC 
His 


GGA CAG 
Gly Gin 
1120 


GAT 
Asp 


ATC 
He 


GTT 
Val 


TAT GAA 
Tyr Glu 
1125 


GAT 
Asp 


ACC 
Thr 


AAT 
Asn 


AAT GAT 
Asn Asp 
1130 


AAC 
Asn 


4971 


CGC 
Arg 


GCA 
Ala 


AGA GAT 
Arg Asp 
1135 


ATC 
He 


GAC 
Asp 


ACC 
Thr 


TTA AAA 
Leu Lys 
1140 


TTT 
Phe 


ACT 
Thr 


GAT 
Asp 


GTG AAT 
Val Asn 
1145 


TAT 
Tyr 


GCG 
Ala 


5019 


GAA 
Glu 


GTG AAA 
Val Lys 
1150 


TTC 
Phe 


CGA 
Arg 


CGA 
Arg 


GTA GAT 
Val Asp 
1155 


AAT 
Asn 


GAC 
Asp 


TTA 
Leu 


ATG TTA 
Met Leu 
1160 


TTC 
Phe 


GGT 
Gly 


TAT 
Tyr 


5067 


CAT GAT ACG GAT 
His Asp Thr Asp 
1165 


TCG 
Ser 


GTC ACG 
Val Thr 
1170 


ATA 
He 


AAA 
Lys 


TCC 
Ser 


TTC TAC AAC CAT 
Phe Tyr Asn His 
1175 


GTA 
Val 


GAT 
Asp 
1180 


5115 


TAT 
Tyr 


CAA 
Gin 


TTT 
Phe 


GAC 
Asp 


AAA TTG 
Lys Leu 
1185 


GAA 
Glu 


TTT 
Phe 


GCT 
Ala 


GAC CGC AGT 
Asp Arg Ser 
1190 


ATA 
He 


ACT 
Thr 


CGT GAT 
Arg Asp 
1195 


5163 
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GAA CTA GGT AAA CAA GGT ATG GCA TTA TTT GGC ACT GAC GGT GAT GAT 5211 
Glu Leu Gly Lys Gin Gly Met Ala Leu Phe Gly Thr Asp Gly Asp Asp 
1200 1205 1210 

AAT ATC AAC GAC TOG GGA CGT AAC TCG GTG ATT GAT GCC GGT GCG GGT 5259 
Asn He Asn Asp Trp Gly Arg Asn Ser Val He Asp Ala Gly Ala Gly 
1215 1220 1225 

AAT GAT ACG GTT AAT GGC GGT AAT GGC GAT GAC ACC CTC ATC GGC GGC 5307 
Asn Asp Thr Val Asn Gly Gly Asn Gly Asp Asp Thr Leu He Gly Gly 
1230 1235 1240 

AAA GGT AAT GAT ATT CTA AGA GGT GGC TAC GGT GCG GAC ACC TAT ATC 5355 
Lys Gly Asn Asp He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He 
1245 1250 1255 1260 

TTT AGC AAA GGA CAC GGA CAG GAT ATC GTT TAT GAA GAT ACC AAT AAT 5403 
Phe Ser Lys Glv His Glv Gin Asp He Val Tyr Glu Asp Thr Asn Asn 
1265 1270 1275 

GAT AAC CGC GCA AGA GAT ATC GAC ACC TTA AAA TTT ACC GAT GTG AAT 5451 
Asp Asn Arg Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp Val Asn 
1280 1285 1290 

TAT GCG GAA GTG AAA TTC CGA CGA GTA GAT AAT GAC TTA ATG TTA TTC 5499 
Tyr Ala Glu Val Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe 
■ 1295 1300 1305 

GGT TAT CAT GAT ACG GAT TCG GTC ACG GTA AAA TCC TTC TAC AGC CAT 5547 
Gly Tyr His Asp Thr Asp Ser Val Thr Val Lys Ser Phe Tyr Ser His 
1310 1315 1320 

GTA GAT TAT CAA TTT GAC AAA TTG GAG TTT GCT GAC CGC AGT ATA ACT 5595 
Val Asp Tyr Gin Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser He Thr 
1325 1330 1335 1340 

CGC GAT GAA CTG ATT AAA GCA GGG CTT CAT CTA TAC GGC ACC GAT GGC 5643 
Ara Asp Glu Leu He Lys Ala Gly Leu His Leu Tyr Gly Thr Asp Gly 
1345 1350 1355 

AAT GAT GAT ATA AAG GAT CAT GCG GAT TGG GAC AGC ATT TTG GAA GGC 5691 
Asn Asp Asp He Lys Asp His Ala Asp Trp Asp Ser He Leu Glu Gly 
1360 1365 1370 

GGC AAA GGC AAC GAT ATT CTA AGA GGT GGC TAC GGT GCG GAC ACC TAT 5739 
Gly Lys Gly Asn Asp He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr 
1375 1380 1385 

ATC TTT AGC AAA GGA CAC GGA CAG GAT ATC GTT TAT GAA GAT ACC AAT 5787 
He Phe Ser Lys Gly His Gly Gin Asp He Val Tyr Glu Asp Thr Asn 
1390 1395 1400 

AAT GAT AAC CGA GCA AGA GAT ATC GAC ACC TTA AAA TTT ACT GAT GTG 5835 
Asn Asp Asn Arg Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp Val 
1405 1410 1415 1420 

AAT TAT GCG GAA GTG AAA TTC CGA CGA GTA GAT AAT GAC TTA ATG TTA 5883 
Asn Tyr Ala Glu Val Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu 
1425 1430 1435. 

TTC GGT TAT CAT GAT ACG GAT TCG GTC ACG ATA AAA TCC TTC TAC AAC 5931 
Phe Glv Tyr His Asp Thr Asp Ser Val Thr He Lys Ser Phe Tyr Asn 
1440 1445 1450 

CAT GTA GAT TAT CAA TTT GAC AAA TTG GAA TTT GCT GAC CGC AGT ATA 5979 
His Val Asp Tyr Gin Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser He 
1455 1460 1465 
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ACT CGT GAT GAA CTA GGT AAA CAA GGT ATG GCA TTA TTT GGC ACT GAC 6027 
Thr Arg Asp Glu Leu Gly Lys Gin Gly Met Ala Leu Phe Gly Thr Asp 
1470 1475 1480 

GGT GAT GAT AAT ATC AAC GAC TGG GGA CGT AAC TCG GTG ATT GAT GCC 6075 
Gly Asp Asp Asn lie Asn Asp Trp Gly Arg Asn Ser Val lie Asp Ala 
1485 1490 1495 1500 

GGT GCG GGT AAT GAT ACG GTT AAT GGC GGT AAT GGC GAT GAC ACC CTC 6123 
Gly Ala Gly Asn Asp Thr Val Asn Gly Gly Asn Gly Asp Asp Thr Leu 
1505 1510 1515 

ATC GGC GGC AAA GGT AAT GAT ATT CTA AGA GGT GGC TAC GGT GCG GAC 6171 
lie Gly Gly Lys Gly Asn Asp lie Leu Arg Gly Gly Tyr Gly Ala Asp 
1520 1525 1530 

ACC TAT ATC TTT AGC AAA GGA CAC GGA CAG GAT ATC GTT TAT GAA GAT 6219 
Thr Tyr lie Phe Ser Lys Gly His Gly Gin Asp lie Val Tyr Glu Asp 
1535 1540 " 1545 

ACC AAT AAT GAT AAC CGC GCA AGA GAT ATC GAC ACC TTA AAA TTT ACT 6267 
Thr Asn Asn Asp Asn Arg Ala Arg Asp lie Asp Thr Leu Lys Phe Thr 
1550 1555 1560 

GAT ATT AAT TTA TCC GAA CTT TGG TTT AGC CGA GAA AAT AAC GAT TTG 6315 
Asp lie Asn Leu Ser Glu Leu Trp Phe Ser Arg Glu Asn Asn Asp Leu 
1565 1570 1575 1580 

ATT ATT AAA TCA TTA TTA AGT GAG GAT AAA GTC ACG GTT CAA AAT TGG 6363 
lie lie Lys Ser Leu Leu Ser Glu Asp Lys Val Thr Val Gin Asn Trp 
1585 1590 1595 

TAT TCA CAC CAA GAT CAT AAA ATA GAA AAT ATT CGT TTA TCG AAT GAG 6411 
Tyr Ser His Gin Asp His Lys lie Glu Asn lie Arg Leu Ser Asn Glu 
1600 1605 1610 

CAA ACG TTG GTG AGC ACT CAG GTG GAG AAG ATG GTT GAG TCG ATG GCC 6459 
Gin Thr Leu Val Ser Thr Gin Val Glu Lys Met Val Glu Ser Met Ala 
1615 1620 1625 

GGC TTT GCT CAG AAG CAC GGA GGA GAG ATA TCT CTT GTG TCG CTT GAA 6507 
Gly Phe Ala Gin Lys His Gly Gly Glu lie Ser Leu Val Ser Leu Glu 
1630 1635 1640 

GAG GTA AAA CAA TAT ATC AAT AGC TTA ACA GCT GCT TTA TAA 6549 
Glu Val Lys Gin Tyr lie Asn Ser Leu Thr Ala Ala Leu ♦ 
1645 1650 1655 

CATACGAAAG AAATCGGCAC AGTTTTTTTG AACTGTGCCG ATTTGATTTT AGTGTAAGAA 6609 

TATAGCCTGA TTTTAAGAAA TTTACTCTTG GCTAATAACT ATTTCCCATT TTATAAGTTA 6669 

TTGACGGATG GTTTTATCAA ATATGAGATC AAATCTTATT TTAAATTCGC TTTCCATTAA 6729 

GCGATAT €736 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1658 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Asp Asn Ala Phe Phe Val lie Glu Glu Ser Gly Lys Arg Tyr 
15 10 15 

lie Glu Asn Phe Gly lie Glu Pro Leu Gly Lys Gin Glu Asp Phe Asp 
20 25 30 

Phe Val Gly Gly Phe Trp Ser Asn Leu Val Asn Arg Gly Leu Glu Ser 
35 40 45 

lie lie Asp Pro Ser Gly lie Gly Gly Thr Val Asn Leu Asn Phe Thr 
50 55 60 

Gly Glu Val Glu Thr Tyr Thr Leu Asp Glu Thr Arg Phe Lys Ala Glu 
65 70 75 80 

Ala Ala Lys Lys Ser His Trp Ser Leu Val Asn Ala Ala Lys Val Tyr 
85 90 95 

Gly Gly Leu Asp Gin lie lie Lys Lys Leu Trp Asp Ser Gly Ser lie 
100 105 110 

Lys His Leu Tyr Gin Asp Lys Asp Thr Gly Lys Leu Lys Pro lie lie 
115 120 125 

Tyr Gly Thr Ala Gly Asn Asp Ser Lys lie Glu Gly Thr Lys He Thr 
130 135 140 

Arg Arg lie Ala Gly Lys Glu Val Thr lieu Asp He Ala Asn Gin Lys 
145 150 155 160 

He Glu Lys Gly Val Leu Glu Lys Leu Gly Leu Ser Val Ser Gly Ser 
165 170 175 

Asp He He Lys Leu Leu Phe Gly Ala Leu Thr Pro Thr Leu Asn Arg 
180 185 190 

Met Leu Leu Ser Gin Leu He Gin Ser Phe Ser Asp Ser Leu Ala Lys 
195 200 205 

Leu Asp Asn Pro Leu Ala Pro Tyr Thr Lys Asn Gly Val Val Tyr Val 
210 215 220 

Thr Gly Lys Gly Asn Asp Val Leu Lys Gly Thr Glu His Glu Asp Leu 
225 230 235 240 

Phe Leu Gly Gly Glu Gly Asn Asp Thr Tyr Tyr Ala Arg Val Gly Asp 
245 250 255 

Thr He Glu Asp Ala Asp Gly Lys Gly Lys Val Tyr Phe Val Arg Glu 

260 265 270 

Lys Gly Val Pro Lys Ala Asp Pro Lys Arg Val Glu Phe Ser Glu Tyr 
275 280 285 

He Thr Lys Glu Glu He Lys Glu Val Glu Lys Gly Leu Leu Thr Tyr 

290 295 300 

Ala Val Leu Glu Asn Tyr Asn Trp Glu Glu Lys Thr Ala Thr Phe Ala 
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His Ala Thr Met Leu Asn Glu Leu Phe Thr Asp Tyr Thr Asn Tyr Arg 
325 330 335 

Tyr Glu Val Lys Gly Leu Lys Leu Pro Ala Val Lys Lys Leu Lys Ser 
340 345 350 

Pro Leu Val Glu Phe Thr Ala Asp Leu Leu Thr Val Thr Pro lie Asp 
355 360 365 

Glu Asn Gly Lys Ala Leu Ser Glu Lys Ser lie Thr Val Lys Asn Phe 
370 375 380 

Lys Asn Gly Asp Leu Gly He Arg Leu Leu Asp Pro Asn Ser Tyr Tyr 
385 390 395 400 

Tyr Phe Leu Glu Gly Gin Asp Thr Gly Phe Tyr Gly Pro Ala Phe Tyr 
405 410 415 

He Glu Arg Lys Asn Gly Gly Gly Ala Lys Asn Asn Ser Ser Gly Ala 
420 425 430 

Gly Asn Ser Lys Asp Trp Gly Gly Asn Gly His Gly Asn His Arg Asn 
435 440 445 

Asn Ala Ser Asp Leu Asn Lys Pro Asp Gly Asn Asn Gly Asn Asn Gin 
450. 455 460 

Asn Asn Gly Ser Asn Gin Asp Asn His Ser Asp Val Asn Ala Pro Asn 
465 470 475 480 

Asn Pro Gly Arg Asn Tyr Asp He Tyr Asp Pro Leu Ala Leu Asp Leu 
485 490 495 

Asp Gly Asp Gly Leu Glu Thr Val Ser Met Asn Gly Arg Gin Gly Ala 

500 505 510 

Leu Phe Asp His Glu Gly Lys Gly He Arg Thr Ala Thr Gly Trp Leu 
515 520 525 

Ala Ala Asp Asp Gly Phe Leu Val Leu Asp Arg Asn Gin Asp Gly He 
530 535 540 

He Asn Asp He Ser Glu Leu Phe Ser Asn Lys Asn Gin Leu Ser Asp 
545 550 555 560 

Gly Ser He Ser Ala His Gly Phe Ala Thr Leu Ala Asp Leu Asp Thr 
565 570 575 

Asn Gin Asp Gin Arg He Asp Gin Asn Asp Lys Leu Phe Ser Lys Leu 
580 585 590 

Gin He Trp Arg Asp Leu Asn Gin Asn Gly Phe Ser Glu Ala Asn Glu 
595 600 605 

Leu Phe Ser Leu Glu Ser Leu Asn He Lys Ser Leu His Thr Ala Tyr 

610 615 620 

Glu Glu Arg Asn Asp Phe Leu Ala Gly Asn Asn He Leu Ala Gin Leu 
625 630 635 640 

Gly Lys Tyr Glu Lys Thr Asp Gly Thr Phe Ala Gin Met Gly Asp Leu 
645 650 655 

Asn Phe Ser Phe Asn Pro Phe Tyr Ser Arg Phe Thr Glu Ala Leu Asn 
660 665 670 

Leu Thr Glu Gin Gin Arg Arg Thr He Asn Leu Thr Gly Thr Gly Arg 

^7k; icon foe 
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Val Arg Asp Leu Arg Glu Ala Ala Ala Leu Ser Glu Glu Leu Ala Ala 
690 695 700 

Leu Leu Gin Gin Tyr Thr Lys Ala Ser Asp Phe Gin Ala Gin Ara Glu 
705 710 715 ^ 720 

Leu Leu Pro Ala He Leu Asp Lys Trp Ala Ala Thr Asp Leu Gin Tvr 
725 730 735 

Gin His Tyr Asp Lys Thr Leu Leu Lys Thr Val Glu Ser Thr Asd Ser 
740 745 750 

Ser Ala Ser Val Val Arg Val Thr Pro Ser Gin Leu Ser Ser He Aro 
755 760 765 

Asn Ala Lys His Asp Pro Thr Val Met Gin Asn Phe Glu Gin Ser Lva 
770 775 780 

Ala Lys He Ala Thr Leu Asn Ser Leu Tyr Qly Leu Asn He Asp Gin 
785 790 795 *^ eOO 

Leu Tyr Tyr Thr Thr Asp Lys Asp He Arg Tyr He Thr Asp Lys Val 
805 810 815 

Asn Asn Met Tyr Gin Thr Thr Val Glu Leu Ala Tyr Arg Ser Leu Leu 
820 825 830 

Leu Gin Thr Arg Leu Lys Lys Tyr Val Tyr Ser Val Asn Ala Lys Gin 
835 840 845 

Phe Glu Gly Lys Trp Val Thr Asp Tyr Ser Arg Thr Glu Ala Leu Phe 
850 855 860 

Asn Ser Thr Phe Lys Gin Ser Pro Glu Asn Ala Leu Tyr Asd Leu Ser 
865 870 875 *^ 880 

Glu Tyr Leu Ser Phe Phe Asn Asp Pro Thr Glu Trp Lys Glu Gly Leu 
885 890 895 

Leu Leu Leu Ser Arg Tyr He Asp Tyr Ala Lys Ala Gin Gly Phe Tyr 
900 905 910 

Glu Asn Trp Ala Ala Thr Ser Asn Leu Thr He Ala Arg Leu Arg Glu 
915 920 925 

Ala Gly Val He Phe Ala Glu Ser Thr Asp Leu Lys Gly Asp Glu Lvs 
930 935 940 

Asn Asn He Leu Leu Gly Ser Gin Lys Asp Asn Asn Leu Ser Glv Ser 
945 950 955 960 

Ala Gly Asp Asp Leu Leu He Gly Gly Glu Gly Asn Asp Thr Leu Lys 
965 970 975 

Gly Ser Tyr Gly Ala Asp Thr Tyr He Phe Ser Lys Gly His Gly Gin 
980 985 990 

Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg Ala Arg Asp He 
995 1000 1005 

Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala Glu Val Lys Phe Ara 
1010 1015 1020 

Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His Asp Thr Asp Ser 
1025 1030 1035 1040 

Val Thr Val Lys Ser Phe Tyr Ser His Val Asp Tyr Gin Phe Asp Lys 
1045 1050 lOfic; 
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Leu Glu Phe Ala Aap Arg Ser He Thr A^g Asp Glu Leu He Lys Ala 
1060 1065 1070 

Gly Leu His Leu Tyr Gly Thr Asp Gly Aan Asp Asp He Lys Asp His 
1075 1080 1085 

1090^"^ 1095°^" iJoo'^" 

Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe Ser Lys Gly His Glv 
1105 1110 1115 ^^r^ 

Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg Ala Ar^ Asp 
H2S 1130 1135 

He Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala Glu Val Lys Phe 
1140 1145 1150 

Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His Asp Thr Aan 
1155 1160 1165 ^ 

Ser Val Thr He Lys Ser Phe Tyr Asn His Val Asp Tyr Gin Phe Asp 
1170 1175 1180 

Lys Leu Glu Phe Ala Asp Arg Ser He Thr Arg Asp Glu Leu Gly Lvs 
1185 - 1190 1195 j^Ipq 

Gin Gly Met Ala Leu Phe Gly Thr Asp Gly Asp Asp Asn He Asn Asd 
"05 1210 1215 ^ 

Trp Gly Arg Asn Ser Val He Asp Ala Gly Ala Gly Asn Asp Thr Val 
1220 1225 1230 

Asn Gly Gly Asn Gly Asp Asp Thr Leu He Gly Gly Lys Gly Asn Asp 
1235 1240 1245 

He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe Ser Lvs Glv 
1250 1255 1260 ^ 

His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg Ala 
1265 1270 1275 i280 

Arg Asp He Asp Thr Leu Lys Phe Thr Aap Val Asn Tyr Ala Glu Val 
1285 1290 1295 

Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr Hia Asn 
1300 1305 1310 

Thr Asp Ser Val Thr Val Lys Ser Phe Tyr Ser His Val Asp Tvr Gin 
1315 1320 1325 

Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser He Thr Arg Asp Glu Leu 
1330 1335 1340 

He Lys Ala Gly Leu His Leu Tyr Gly Thr Aap Gly Asn Asp Asp He 
1345 1350 1355 3^35^ 

Lya Aap His Ala Asp Trp Aap Ser He Leu Glu Gly Gly Lys Gly Asn 
1365 1370 1375 

Aap He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe Ser Lvs 
1380 1385 1390 

Gly Hia Gly Gin Aap He Val Tyr Glu Aap Thr Asn Asn Asp Asn Arc 
1395 1400 1405 

Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp Val Asn Tvr Ala Glu 
1410 1415 1420 
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Val Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His 
1425 1430 1435 X440 

Asp Thr Asp Ser Val Thr lie Lys Ser Phe Tyr Asn His Val Asp Tyr 
1445 1450 X455 

Gin Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser He Thr Arg Asp Glu 
1460 1465 

Leu Gly Lys Gin Gly Met Ala Leu Phe Gly Thr Asp Gly Asp Asp Asn 
1475 1480 1485 

He Asn Asp Trp Gly Arg Asn Ser Val He Asp Ala Gly Ala Gly Asn 
1490 1495 1500 

Asp Thr Val Asn Gly Gly Asn Gly Asp Asp Thr Leu He Gly Glv Lvs 
1505 1510 1515 ^ 

Gly Asn Asp He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe 
1525 1530 1535 

Ser Lys Gly His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asn 
1540 1545 1550 

Asn Arg Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp He Asn Leu 
- 1555 1560 1565 

Ser Glu Leu Trp Phe Ser Arg Glu Asn Asn Asp Leu He He Lvs Ser 
1570 1575 1580 

Leu Leu Ser Glu Asp Lys Val Thr Val Gin Asn Trp Tyr Ser His Gin 
1585 1590 1595 16OO 

Asp His Lys He Glu Asn He Arg Leu Ser Asn Glu Gin Thr Leu Val 
1605 1610 1615 

Ser Thr Gin Val Glu Lys Met Val Glu Ser Met Ala Gly Phe Ala Gin 
1620 1625 1630 

Lys His Gly Gly Glu He Ser Leu Val Ser Leu Glu Glu Val Lys Gin 
1635 1640 1645 

Tyr He Asn Ser Leu Thr Ala Ala Leu * 
1650 1655 



38 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinobacillus pleuropneumoniae 

(B) STRAIN: HV114 (serotype 3 field strain) 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: pROKS 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1566. .5714 

(D) OTHER INFORMATION: /codon_start= 1566 
/functiono "RTX- toxin" 
/product* "ApxIV_var3" 
/genes •'apxIV_var3" 
/numbers 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATCGATATGC 


CGCCGGGTAC 


GGGCGATATC 


CAACTTACTC 


TTTCGCAACA 


AATTCCGGTT 


60 


ACCGGTGCGG 


TAGTGGTAAC 


TACTCCGCAA 


GATATTGCGT 


TATTAGATGC 


GGTGAAAGGT 


120 


ATTTCAATGT 


TCCAAAAAGT 


GTCGGTACCG 


GTCTTAGGTA 


TCATTGAAAA 


TATGAGCGTG 


180 


CATATCTGCC 


AAAATTGCGG 


TCACCACGAA 


GATATTTTCG 


GCACCGGCGG 


TGCGGAGAAA 


240 


GTGGCGAAGA 


AATACGGTAC 


TAAAGTATTA 


GGACAAATGC 


CGTTGCATAT 


TCGCTTACGT 


300 


CAAGATTTGG 


ATGCCGGCAC 


ACCGACCGTC 


GTTGCGGCAC 


CGGAACACGA 


CACCAGCAGA 


360 


GCCTATATTG 


AATTAGCGGC 


AAAAGTCGCT 


TCGGAATTAT 


ACTGGCAAGG 


TTCGGTTATC 


420 


CCGTCTGAAA 


TTATGATTCG 


TGAAGTAAAA 


TAAGCCTACA 


TAACCACGGA 


ATACCAGATA 


480 


ACACAGAAGG 


AAAACTVAGCG 


GTAGAATTTG 


CAGAAAAAGT 


TGCAAATTCT 


ACCGCTTTTT 


540 


TATTAGTACG 


ATTCGCTGTT 


GGACTGCCAT 


TTGATTTGGT 


TTGTCAGGAT 


ATTATGTTAT 


600 


TGTAATGAAA 


TGTTAGTGAA 


TTATTTTTAT 


TAATTTGAAA 


GGAGACAAAA 


TGAAAATAAA 


660 


AAAACGTTAC 


ATTGCGCTGC 


TAGCTTTAGG 


CAGTGTTATT 


GGCTATGCCT 


GGTATCAAAA 


720 


TTATCAATGG 


GAACAGTTGA 


TGTTAAGTGG 


CTATTGTGAA 


AAGGACGGAA 


GCTATTGTGA 


780 


TGATAGGCAT 


ACGAAGCAGG 


AACTGATTGA 


TAGGGCAATT 


AACTATGTGC 


TGGAAAATCA 


840 


AATTCAACAG 


ACATATGAAG 


GTGATGACCT 


TGTGGATATA 


AAACAATATT 


CAACAATAGA 


900 


GGAATTTAAA 


AAACTAAATC 


CGAATTGTTG 


TAAGGTAGAT 


TCTTGGCCGG 


ATQATGCTGT 


960 


TCGTGAGGAT 


GCTGATTTAC 


AGCGAGAGGG 


CAAAGCGTAT 


AAATACGTAA 


AAGTCAAATA 


1020 
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TAAl. i vsUAvj 1 






Av^ilaXAl lAA 


AAA r AGATT A 


GATGGAGACA 


1140 


ACACGATGAC 


AAAACTAACT 


A 1 V-UAAVjATCs 


TGACTAATTT 


ATArrrATAT 


AAGCAAAGAA 


1200 


CTTTACCTAC 


GGATAGGTTA 




TTATTAGCAA 


AACAGGAAAA 


GGGGAAAATA 


1260 


TTGATAAAAA 


GGAATTTATG 


GCGGGGCCGG 


GACGTTTTGT 


GACGGCCGAT 


AATTTTAGTG 


1320 


TTGTAAAAGA 


eiTTTTTACT 


GCAAAGGATT 


CATTAATAAA 


CCTAAGCTTG 


CAGACTCGTA 


1380 


TATTAGCXym 


TTTAAAGCCG 


GGCAAATATT 


CCAAAGCGCA 


GATATTAGAA 


ATGTTGGGCT 


1440 


ATACGAAAAA 


TGGAGAAAAG 


GTAGATGGCA 


TGTTTACCGG 


TGAAGTCCAG 


ACATTAGGCT 


1500 


TTTATGACGA 


TGGCAAAGGG 


GATTTACTCG 


AACGCGCCTA 


TATCTGAAAT 


ACCACAGGAT 


1560 


TTAAA ATG AGC GAG AAT 
Met Ser Asp Asn 


GCC TTT TTT GTT ATA GAA GAA TCA GGC AAA 
Ala Phe Phe Val He Glu Glu Ser Gly Lys 


1607 



1660 



1665 



1670 



CGC TAT ATT GAA AAC TTT GGT ATT GAA CCT CTT GGT AAG CAA GAA GAT 
Arg Tyr He Glu Asn Phe Gly He Glu Pro Leu Gly Lys Gin Glu Asp 
1675 1680 1685 



1655 



TTT GAT TTT GTC GGC GGC TTT TGG TCT AAC TTA GTG AAT CGT GGT TTG 
Phe Asp Phe Val Gly Gly Phe Trp Ser Asn Leu Val Asn Arg Gly Leu 
1690 1695 1700 



1703 



GAA AGT ATT ATC GAC CCA TCC GGT ATC GGT GGA ACG GTA AAC CTT AAC 
Glu Ser He He Asp Pro Ser Gly He Gly Gly Thr Val Asn Leu Asn 
1705 1710 1715 1720 



1751 



TTT ACC GGC GAG GTG GAA ACC TAC ACG TTA GAC GAA ACA AGG TTT AAA 
Phe Thr Gly Glu Val Glu Thr Tyr Thr Leu Asp Glu Thr Arg Phe Lys 
1725 1730 1735 



1799 



GCG GAA GCG GCG AAG AAA AGC CAT TGG AGT TTA GTG AAT GCG GCG AAA 
Ala Glu Ala Ala Lys Lys Ser His Trp Ser Leu Val Asn Ala Ala Lys 
1740 1745 1750 



1847 



GTA TAC GGC GGT TTA GAC CAA ATT ATT AAA AAA CTA TGG GAC AGT GGC 
Val Tyr Gly Gly Leu Asp Gin He He Lys Lys Leu Trp Asp Ser Gly 
1755 1760 1765 



1895 



TCA ATT AAG CAT TTA TAT CAA GAT AAA GAT ACG GGC AAA TTA AAA CCG 
Ser He Lys His Leu Tyr Gin Asp Lys Asp Thr Gly Lys Leu Lys Pro 
1770 1775 1780 



1943 



ATT ATT TAC GGC ACG GCC GGC AAC GAC AGT AAG ATT GAA GGC ACT AAA 
He He Tyr Gly Thr Ala Gly Asn Asp Ser Lys He Glu Gly Thr Lys 
1785 1790 1795 1800 



1991 



ATC ACC CGT AGG ATT GCG GGT AAA GAA GTT ACG CTT GAT ATT GCC AAT 
He Thr Arg Arg He Ala Gly Lys Glu Val Thr Leu Asp He Ala Asn 
1805 1810 1815 



2039 



CAG AAA ATT GAA AAA GGC GTG TTA GAG AAA TTG GGG CTG TCT GTT AGT 
Gin Lys He Glu Lys Gly Val Leu Glu Lys Leu Gly Leu Ser Val Ser 
1820 1825 1830 



2087 



GGT TCG GAT ATC ATT AAA TTG TTG TTT GGA GCA TTG ACT CCA ACT TTA 
Gly Ser Asp He He Lys Leu Leu Phe Gly Ala Leu Thr Pro Thr I^eu 
1835 1840 1845 



2135 



AAT AGA ATG TTG CTA TCA CAA CTT ATC CAG TCT TTT TCC GAT AGC TTG 
Asn Arg Met Leu Leu Ser Gin Leu He Gin Ser Phe Ser Asp Ser Leu 
1850 1855 1860 



2183 



40 



GCT AAA CTT GAT AAT CCC TTA GCC CCT TAC ACT AAA AAT GGC GTG GTT 
Ala Lys Leu Asp Asn Pro Leu Ala Pro Tyr Thr Lys Asn Gly Val Val 
1865 1870 1875 i880 

TAT GTC ACC GGC AAA GGG AAT GAT GTG CTT AAA GGA ACT GAA CAT GAG 
Tyr Val Thr Gly Lys Gly Asn Asp Val Leu Lys Gly Thr Glu His Glu 
1885 1890 1895 

GAT TTG TTT CTC GGT GGT GAG GGG AAT GAT ACT TAT TAT GCG AGA GTA 
Asp Leu Phe Leu Gly Gly Glu Gly Asn Asp Thr Tyr Tyr Ala Arg Val 
1900 1905 1910 

GGC GAT ACA ATT GAA GAC GCC GAC GGC AAA GGT AAA GTC TAT TTT GTG 
Gly Asp Thr He Glu Asp Ala Asp Gly Lys Gly Lys Val Tyr Phe Val 
1915 1920 1925 

AGA GAA AAA GGG GTA CCT AAG GCG GAT CCT AAG CGG GTA GAG TTT AGC 
Arg Glu Lys Gly Val Pro Lys Ala Asp Pro Lys Ara Val Glu Phe Ser 
1930 1935 j^^Iq 

GAG TAC ATA ACG AAA GAA GAA ATA AAA GAG GTT GAA AAG GGG TTA TTA 
Glu Tyr He Thr Lys Glu Glu He Lys Glu Val Glu Lys Gly Leu Leu 
1945 1950 1955 i960 

ACT TAC GCA GTT TTA GAA AAT TAT AAT TGG GAA GAG AAA ACG GCG ACT 
Thr Tyr Ala Val Leu Glu Asn Tyr Asn Trp Glu Glu Lys Thr Ala Thr 
1965 1970 1975 

TTC GCT CAT GCG ACT ATG CTT AAT GAG CTT TTT ACT GAT TAT ACT AAT 
Phe Ala His Ala Thr Met Leu Asn Glu Leu Phe Thr Asp Tyr Thr Asn 
1980 1985 1990 

TAT CGT TAT GAA GTT AAA GGA CTA AAA TTG CCC GCC GTT AAA AAG TTA 
Tyr Arg Tyr Glu Val Lys Gly Leu Lys Leu Pro Ala Val Lys Lys Leu 
1995 2000 2005 

AAA AGT CCG TTG GTG GAG TTT ACA GCT GAT TTA TTA ACT GTT ACG CCT 
Lys Ser Pro Leu Val Glu Phe Thr Ala Asp Leu Leu Thr Val Thr Pro 
2010 2015 2020 

ATT GAC GAA AAC GGA AAA GCA CTT AGC GAA AAA AGT ATT ACG GTT AAA 
He Asp Glu Asn Gly Lys Ala Leu Ser Glu Lys Ser He Thr Val Lys 
2025 2030 2035 2040 

AAT TTT AAA AAT GGT GAT TTA GGA ATA AGG TTG TTG GAT CCT AAT AGC 
Asn Phe Lys Asn Gly Asp Leu Gly He Arg Leu Leu Asp Pro Asn Ser 
2045 2050 2055 

TAT TAT TAT TTC CTT GAA GGC CAA GAT ACG GGT TTT TAT GGT CCT GCT 
Tyr Tyr Tyr Phe Leu Glu Gly Gin Asp Thr Gly Phe Tyr Gly Pro Ala 
2060 2065 2070 

TTT TAT ATT GAA CGA AAA AAC GGT GGA GGC TCT AAA AAT AAC TCG TCG 
Phe Tyr He Glu Arg Lys Asn Gly Gly Gly Ser Lys Asn Asn Ser Ser 
2075 2080 2085 

GGA GCA GGA AAT AGC AAA GAT TGG GGC GGG AAC GGG CAT GGA AAT CAC 
Gly Ala Gly Asn Ser Lys Asp Trp Gly Gly Asn Gly His Gly Asn His 
2090 2095 2100 



2231 



2279 



2327 



2375 



2423 



2471 



2519 



2567 



2615 



2663 



2711 



2759 



2807 



2855 



2903 



CGA 


AAT 


AAT 


GCC 


TCC 


GAC CTG AAT AAA 


CCG 


GAC GGA AAT AAT 


GGG 


AAT 


Arg 


Asn 


Asn 


Ala 


Ser 


Asp Leu Asn Lys 


Pro 


Asp Gly Asn Asn 


Gly 


Asn 


2105 








2110 




2115 


2120 


AAC 


CAA 


AAT 


AAC 


GGA 


AGC AAT CAA GAT 


AAT 


CAT AGC GAT GTG 


AAT 


GCG 


Asn 


Gin 


Asn 


Asn 


Gly 


Ser Asn Gin Asp 


Asn 


His Ser Asp Val 


Asn 


Ala 



2125 



2130 



2135 



2951 



2999 



3047 



3095 



3143 



3191 



3239 



3287 



3335 



41 



CCA AAT AAC CCG GGA CGT AAC TAT GAT ATT TAC GAT CCT TTA GCT TTA 
Pro Asn Asn Pro Gly Arg Asn Tyr Asp lie Tyr Asp Pro Leu Ala Leu 
2140 2145 2150 

GAT TTA GAT GGA GAT GGG CTT GAA ACC GTG TCG ATG AAC GGG CGA CAA 
Asp Leu Asp Gly Asp Gly Leu Glu Thr Val Ser Met Asn Gly Arg Gin 
2155 2160 2165 

GGC GCG TTA TTC GAT CAT GAA GGA AAA GGT ATT CGT ACC GCA ACG GGC 
Gly Ala Leu Phe Asp His Glu Gly Lys Gly lie Arg Thr Ala Thr Glv 
2170 2175 2180 

TGG CTC GCT GCG GAT GAC GGT TTT TTA GTG TTA GAT CGT AAC CAA GAC 
Trp Leu Ala Ala Asp Asp Gly Phe Leu Val Leu Asp Arg Asn Gin Asd 
2185 2190 2195 2200 

GGC ATT ATT AAT GAT ATA AGC GAG TTA TTT AGT AAT AAA AAT CAA CTT 
Gly lie He Asn Asp He Ser Glu Leu Phe Ser Asn Lys Asn Gin Leu 
2205 2210 2215 

TCC GAC GGG AGT ATT TCT GCA CAC GGT TTT GCG ACA TTA GCC GAT TTG 
Ser Asp Gly Ser He Ser Ala His Gly Phe Ala Thr Leu Ala Asp Leu 
2220 2225 2230 

GAT ACA AAC CAA GAT CAG CGT ATC GAC CAA AAT GAT AAG CTG TTT TCT 
Asp Thr Asn Gin Asp Gin Arg He Asp Gin Asn Asp Lys Leu Phe Ser 
2235 2240 2245 

AAA CTC CAA ATT TGG CGG GAT TTA AAT CAA AAC GGT TTT AGT GAA GCG 3383 
Lys Leu Gin He Trp Arg Asp Leu Asn Gin Asn Gly Phe Ser Glu Ala 
2250 2255 2260 

AAT GAG CTG TTT AGC TTA GAA AGT TTG AAT ATT AAA TCT TTA CAT ACC 3431 
Asn Glu Leu Phe Ser Leu Glu Ser Leu Asn He Lys Ser Leu His Thr 
2265 2270 2275 2280 

GCC TAT GAA GAG CGT AAT GAT TTT CTA GCG GGC AAT AAT ATC CTT GCT 3479 
Ala Tyr Glu Glu Arg Asn Asp Phe Leu Ala Gly Asn Asn He Leu Ala 
2285 2290 2295 

CAG CTT GGG AAG TAT GAA AAA ACG GAC GGT ACT TTT GGA CAA ATG GGC 3527 
Gin Leu Gly Lys Tyr Glu Lys Thr Asp Gly Thr Phe Gly Gin Met Gly 
2300 2305 2310 

GAT TTA AAT TTC AGT TTT AAC CCG TTT TAT AGC CGA TTT ACC GAA GCG 3575 
Asp Leu Asn Phe Ser Phe Asn Pro Phe Tyr Ser Arg Phe Thr Glu Ala 
2315 2320 2325 

TTA AAT TTA ACC GAG CAA CAA CGT CGC ACA ATT AAT CTA ACC GGC ACC 3623 
Leu Asn Leu Thr Glu Gin Gin Arg Arg Thr He Asn Leu Thr Gly Thr 
2330 2335 2340 

GGT CGG GTT CGG GAT TTG CGT GAA GCC GCC GCA CTT TCT GAG GAG TTG 3671 
Gly Arg Val Arg Asp Leu Arg Glu Ala Ala Ala Leu Ser Glu Glu Leu 
2345 2350 2355 2360 

GCT GCT TTA TTA CAA CAG TAC ACT AAG GGC TCC GAT TTT CAG GCA CAA 3719 
Ala Ala Leu Leu Gin Gin Tyr Thr Lys Gly Ser Asp Phe Gin Ala Gin 
2365 2370 2375 ' 

CGA GAA TTA TTG CCT GCC ATT TTA GAT AAA TGG GCG GCA ACG GAT TTA 3767 
Arg Glu Leu Leu Pro Ala He Leu Asp Lys Trp Ala Ala Thr Asp Leu 
2380 2385 2390 



CAG TAT CAA CAT TAT GAT AAA ACA TTA CTT AAA ACG GTA GAA AGT ACC 
Gin Tyr Gin His Tyr Asp Lys Thr Leu Leu Lys Thr Val Glu Ser Thr 
2395 2400 2405 



3815 



42 



GAT AGT AGT GCT TCT GTC GTT AGA GTC ACG CCT TCT CAA TTA AGT AGT 
Asp Ser Ser Ala Ser Val Val Arg Val Thr Pro Ser Gin Leu Ser Ser 
2410 2415 2420 



3863 



ATA CGC AAT GTA AAG CAT GAT CCT ACC GTT ATG CAA AAC TGT GAA CAA 
lie Arg Asn Val Lys His Asp Pro Thr Val Met Gin Asn Cys Glu Gin 
2425 2430 2435 2440 



3911 



AGT AAG GCA AAA ATT GCG ACT TTA AAT TCG CTC TAC GGG TTA AAT ATT 
Ser Lys Ala Lys lie Ala Thr Leu Asn Ser Leu Tyr Gly Leu Asn lie 
2445 2450 2455 



3959 



GAT CAA CTT TAT TAT ACG ACG GAT AAA GAC ATT CGT TAT ATT ACT GAC 
Asp Gin Leu Tyr Tyr Thr Thr Asp Lys Asp lie Arg Tyr lie Thr Asp 
2460 2465 2470 



4007 



AAA GTG AAT AAT ATG TAT CAA ACA ACC GGA GAA CTC GGC TAT CGT TCT 
Lys Val Asn Asn Met Tyr Gin Thr Thr Gly Glu Leu Gly Tyr Arg Ser 
2475 2480 2485 



4055 



TTA CTT TTA CAA ACG CGT GTG AAG AAA TAT GTT TAT AGC GTT AAT GCG 
Leu I^eu Leu Gin Thr Arg Val Lys Lys Tyr Val Tyr Ser Val Asn Ala 
2490 2495 2500 



4103 



AAA CAA TTC GAA GGG AAA TGG GTA GCC GAT TAT TCT CGT ACT GAA GCC 
Lys Gin Phe Glu Gly Lys Trp Val Ala Asp Tyr Ser Arg Thr Glu Ala 
2505 2510 2515 2520 



4151 



TTA TTT AAC TCT ACT TAT AAA CAA TCG CCC GAA AAT GTA TTA TAT GAT 
Leu Phe Asn Ser Thr Tyr Lys Gin Ser Pro Glu Asn Val Leu Tyr Asp 
2525 2530 2535 



4199 



TTA CGC GAA TAC CTT TCT TTC TAT AAC GAC CCT ACG GAA TGG AAA GAA 
Leu Arg Glu Tyr Leu Ser Phe Tyr Asn Asp Pro Thr Glu Trp Lys Glu 
2540 2545 2550 



4247 



GGG CTA TTA CTG TTA AGC CGT TAT ATA GAT TAT GCT AAA GCA CAA GGA 
Gly Leu Leu Leu heu Ser Arg Tyr He Asp Tyr Ala Lys Ala Gin Gly 
2555 2560 2565 



4295 



TTT TAT GAA AAC TGG GCG GCT ACT TCT AAC TTA ACT ATT GCC CGT TTA 
Phe Tyr Glu Asn Trp Ala Ala Thr Ser Asn Leu Thr He Ala Arg Leu 
2570 2575 2580 



4343 



AGA GAG GCT GGA GTA ATT TGT GCA GAA TCG ACG GAT TTA AAA GGC GAT 
Arg Glu Ala Gly Val He Cys Ala Glu Ser Thr Asp Leu Lys Gly Asp 
2585 2590 2595 2600 



4391 



GAA AAA AAT AAT ATT GTG TTA GGT AGC CAA AAA GAT AAT AAC TTA TCG 
Glu Lys Asn Asn lie Val Leu Gly Ser Gin Lys Asp Asn Asn Leu Ser 
2605 2610 2615 



4439 



GGT AGT GCA GGT GAT GAT CTA CTT ATC GGC GGA GAG GGT AAT GAT ACG 
Gly Ser Ala Gly Asp Asp Leu Leu He Gly Gly Glu Gly Asn Asp Thr 
2620 2625 2630 



4487 



TTA AAA GGC AGC TAC GGT GCA GAC ACC TAT ATC TTT AGC AAA GGG CAT 
Leu Lys Gly Ser Tyr Gly Ala Asp Thr Tyr He Phe Ser Lys Gly His 
2635 2640 2645 



4535 



GGA CAA GAT GTA ATT TAT GAA TAT TCC GAC AGT GCA AAC TCT AAA AAA 
Gly Gin Asp Val He Tyr Glu Tyr Ser Asp Ser Ala Asn Ser Lys Lys 
2650 2655 2660 



4583 



GAT ATT GAT ACC TTA AAA TTT ACC GAT GTG AAT TAT GCG GAA GTG AAG 
Asp He Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala Glu Val Lys 
2665 2670 2675 2660 



4631 
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TTT CGA CGA GTA GAT AAT GAC TTA ATG TTA TTC GGT TAT CAT GAT ACG 4679 
Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His Asp Thr 
2685 2690 2695 

GAT TCG GTC ACG GTA AAA TCC TTC TAC AGC CAT GTA GAT TAT CAA TTT 4727 
Asp Ser Val Thr Val Lys Ser Phe Tyr Ser His Val Asp Tyr Gin Phe 
2700 2705 2710 



GAC AAA TTG GAG TTT GCT GAC CGC AGT ATA ACT CGC GAT GAA CTG ATT 
Asp Lys Leu Glu Phe Ala Asp Arg Ser lie Thr Arg Asp Glu Leu He 
2715 2720 2725 



4775 



AAA GCA GGG CTT CAT CTA TAC GGC ACC GAT GGC AAT GAT GAT ATA AAG 
Lys Ala Gly Leu His Leu Tyr Gly Thr Asp Gly Asn Asp Asp He Lys 
2730 2735 2740 



4823 



GAT CAT GCG GAT TGG GAC AGC ATT GTG GAA GGC GGC AAA GGC AAC GAT 4871 
Asp His Ala Asp Trp Asp Ser He Val Glu Gly Gly Lys Gly Asn Asp 
2745 2750 2755 2760 

ATT CTA AGA GGT GGC TAC GGT GCG GAC ACC TAT ATC TTT AGC AAA GGA 4919 
He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe Ser Lys Gly 
2765 2770 2775 

CAC GGA CAG GAT ATC GTT TAT GAA GAT ACC AAT AAT GAT AAC CGA GCA 4967 
His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg Ala 
2780 2785 2790 

AGA GAT ATC GAC ACC TTA ACA TTT ACT GAT GTG AAT TAT GCG GAA GTG 5015 
Arg Asp He Asp Thr Leu Thr Phe Thr Asp Val Asn Tyr Ala Glu Val 
2795 2800 2805 



AAA TTC COA CGA GTA GAT AAT GAC TTA ATG TTA TTC GGT TAT CAT GAT 
Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His Asp 
2810 2815 2820 



5063 



ACQ GAT TCG GTC ACQ ATA AAA TCC TTC TAC AAC CAT GTA GAT TAT CAA 
Thr Asp Ser Val Thr He Lys Ser Phe Tyr Asn His Val Asp Tyr Gin 
2825 2830 2835 2840 



5111 



TGT GAC AAA TTG GAC TTT GCT GAC CGC AGT ATA ACT CGT GAT GAA CTA 
Cys Asp Lys Leu Asp Phe Ala Asp Arg Ser He Thr Arg Asp Glu Leu 
2845 2850 2855 



5159 



GGT AAA CAA GGT ATG GCA TTA TTT GGC ACT GAC GGC GAT GAT AAT ATC 
Gly Lys Gin Gly Met Ala Leu Phe Gly Thr Asp Gly Asp Asp Asn He 
2860 2865 2870 



5207 



AAC GAC TGG GGA CGT AAC TCG GTG ATT GAT GCC GGT GCG GGT AAT GAT 
Asn Asp Trp Gly Arg Asn Ser Val He Asp Ala Gly Ala Gly Asn Asp 
2875 2880 2885 



5255 



ACG GTT AAT GGC GGT AAT GGC GAT GAC ACC CTC ATC GGC GGC AAA GGT 
Thr Val Asn Gly Gly Asn Gly Asp Asp Thr Leu He Gly Gly Lys Gly 
2890 2895 2900 



5303 



AAT GAT ATT CTA AGA GGT GGC TAC GGT GCG GAC ACC TAT ATC TTT AGC 
Asn Asp He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe Ser 
2905 2910 2915 2920 



5351 



PiAA GGA CAC GGA CAG GAT ATC GTT TAT GAA GAT ACC AAT AAT GAT AAC 
Lys Gly His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn 
2925 2930 2935 



5399 



CGC GCA AGA GAT ATC GAC ACC TTA AAA TTT ACT GAT ATT AAT TTA TCC 
Arg Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp He Asn Leu Ser 
2940 2945 2950 



5447 



44 



GAA CTT TGG TTT AGC CGA GAA AAT AAC GAT TTG ATT ATT AAA TCA TTA 
Glu Leu Trp Phe Ser Arg Glu Asn Asn Asp Leu He He Lys Ser Leu 
2955 2960 2965 

TTA AGT GAG GAT AAA GTC ACG GTT CAA AAT TGG TAT TCA CAC CAA GAT 
"^^^ f ^^^^^^ Thr Val Gin Asn Trp Tyr Ser His Gin aJd 

2970 2975 2980 

CAT AAA ATA GAA AAT ATT CGT TTA TCG AAT GAG CAA ATG TTG GTG AGC 
naL^^^ ^^"^ Ser Asn Glu Gin Met Leu Val Ser 

2985 2990 2995 3000 

ACT CAG GTG GAG AAG ATG GTT GAG TCG ATG GCC GGC TTT GCT CAG AAr 
Thr Gin Val Glu Lys Met Val Glu Ser Met Ala Gly Phe Ala Gin Lvs 
3005 3010 3015 

CAC GGA GGA GAG ATA TCT CTT CTG TCG CCT GAA GAG GTA AAA CAA TAT 
His Gly Gly Glu He Ser Leu Leu Ser Pro Glu Glu Val Lys Gin Tw 
3020 3025 3030 

ATC AAT AGC TTA ACA GCT GCT TTA TAA CATACQAAAG AAATCGGCAC 
He Asn Ser Leu Thr Ala Ala Leu ♦ 
3035 3040 

AGrrrTTGTG AACTGTGCCG ATTTGATTTT AGTGTAAGAA TATAGCCTGA TTTTAAGAAA 
TTTACTCTTG GCTAATAACT ATTTCCCATT TTATAAGTTA TTOACGGATC GTTTTATCAA 
ATATGAGATC AAATCTTATT TTAAATTCGC TTTCCATTAA GCQATATTGA TCTTTTAAGT 
TTOGGGCCGC ATGAGTTTGG AACCQATACC ACTCATTOTG GGAATCAATA CACAATACGC 
TOTAATCGOA CTCTTGCAGT TCATAATAAT GCTTTCTCTC COTTAATTCT TCTTGCGTAT 
ATGGCGAGAG ATTAAAGCTG AATGGCTGGT TCGCACTAAC AAACAGGTTC TCCGATTTCA 
GATATTCACA ACCGTAATGG CTACCGGTTT CCTGCGGTTT TACATAATTG GTATGATTTT 
GTTTAGCTGT TATACGGTAO ATGCCTAATT GTGGTAAATT GCGTGTGTCA ATATAGCTTT 
CTTGTTCTCC GTAACCGAAA TACTCAATGG CGTTTTCTGT TTTAGCTAAG AAGAAACGTA 
AGCCGAAGCG GGGTAAATAC GGTAATTCGA TCGGGCGAAT AGCGTTAATT TCAACCGAAA 
GTTGTCCGTC ATTGAAGATA CGATAACGAA TATCCAGTGT TAAAATGCGA CCGCGAGAAA 
TTGACACAAT TGCAGATTTT ACTGAAAATT CGACCGCTTG TTCGCTTTGC TGCCACTGAA 
TTTCATACGC TCTGGTATAG GCTTTATCGT AGCCGGCATT TTGGCACGCC TCACGAATGA 
GGCGATCATT GTCGGTTGGC GCACGCCAAA TATTAAAATC TAACGATTGT TGGATAATCG 
CTTTACCGGC TTTTTCAATA CGGGTGAAAA TCCCTTTCTG TTTATCTAAT TGATAACTAA 
ATTGACCGTT GTGTACGTTA ATGTGGAAGC GATCTTCTTG TACTTCAAAT GCACTGTTCT 
CAATTGTGAA TTGTGGTAAT ACTAATTTAT TTTCGCTAAA TAAATTGAGC TGCTCGAAGC 
CAAGTGAATG TGCTTCGTCT AATAATTCGA CCGCGGTATT TAAGCGATAA TTTAAATTCA 
GTAGCCATAA ATGCCCGTTA TTTTTTGGTA ACTCAATCGG TAATACTACG CTGCCGTGCG 
GTTGGCAAGA AACGGATAAA TTCCCACCGC TTGTCACCAC GCCGTTTTCG ACAAATTCGT 
AATCAATCGT TAAATAATCG GCAAGATCAG TGAAATCCAA GTAGTTGTGG ATCACAATTT 
GGTTATCQAT 



5495 

5543 

5591 

5639 

5687 

5734 

5794 

5854 

5914 

5974 

6034 

6094 

6154 

6214 

6274 

6334 

6394 

6454 

6514 

6574 

6634 

6694 

6754 

6814 

6874 

6934 

6994 

7004 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ser Asp Asn Ala Phe Phe Val lie Glu Glu Ser Gly Lys Arg Tyr 
15 10 15 

lie Glu Asn Phe Gly lie Glu Pro Leu Gly Lys Gin Glu Asp Phe Asp 
20 25 30 

Phe Val Gly Gly Phe Trp Ser Asn Leu Val Asn Arg Gly Leu Glu Ser 
35 40 45 

lie lie Asp Pro Ser Gly lie Gly Gly Thr Val Asn Leu Asn Phe Thr 
50 55 60 

Gly Glu Val Glu Thr lyr Thr Leu Asp Glu Thr Arg Phe Lys Ala Glu 
65 70 75 80 

Ala Ala Lys Lys Ser His Trp Ser Leu Val Asn Ala Ala Lys Val Tyr 
85 90 95 

Gly Gly Leu Asp Gin lie lie Lys Lys Leu Trp Asp Ser Gly Ser lie 
100 105 110 

Lys His Leu Tyr Gin Asp Lys Asp Thr Gly Lys Leu Lys Pro lie lie 
115 120 125 

Tyr Gly Thr Ala Gly Asn Asp Ser Lys lie Glu Gly Thr Lys lie Thr 
130 135 140 

Arg Arg lie Ala Gly Lys Glu Val Thr Leu Asp lie Ala Asn Gin Lys 
145 150 155 160 

lie Glu Lys Gly Val Leu Glu Lys Leu Gly Leu Ser Val Ser Gly Ser 
165 170 175 

Asp lie lie Lys Leu Leu Phe Gly Ala Leu Thr Pro Thr Leu Asn Arg 

ISO 185 190 

Met Leu Leu Ser Gin Leu lie Gin Ser Phe Ser Asp Ser Leu Ala Lys 
195 200 205 

Leu Asp Asn Pro Leu Ala Pro Tyr Thr Lys Asn Gly Val Val Tyr Val 
210 215 220 

Thr Gly Lys Gly Asn Asp Val Leu Lys Gly Thr Glu His Glu Asp Leu 
225 230 235 240 

Phe Leu Gly Gly Glu Gly Asn Asp Thr Tyr Tyr Ala Arg Val Gly Asp 
245 250 255 

Thr lie Glu Asp Ala Asp Gly Lys Gly Lys Val Tyr Phe Val Arg Glu 
260 265 270 

Lys Gly Val Pro Lys Ala Asp Pro Lys Arg Val Glu Phe Ser Glu Tyr 
275 280 285 

lie Thr Lys Glu Glu He Lys Glu Val Glu Lys Gly Leu Leu Thr Tyr 
290 295 300 

Ala Val Leu Glu Asn Tyr Asn Trp Glu Glu Lys Thr Ala Thr Phe Ala 
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His Ala Thr Met Leu Asn Glu Leu Phe Thr Asp Tyr Thr Asn Tyr Arg 
325 330 335 

Tyr Glu Val Lys Gly Leu Lys Leu Pro Ala Val Lys Lys Leu Lys Ser 
340 345 350 

Pro Leu Val Glu Phe Thr Ala Asp Leu Leu Thr Val Thr Pro lie Asp 

355 360 365 

Glu Asn Gly Lys Ala Leu Ser Glu Lys Ser lie Thr Val Lys Asn Phe 
370 375 380 

Lys Asn Gly Asp Leu Gly lie Arg Leu Leu Asp Pro Asn Ser Tyr Tyr 
385 390 395 400 

Tyr Phe Leu Glu Gly Gin Asp Thr Gly Phe Tyr Gly Pro Ala Phe Tyr 
405 410 415 

lie Glu Arg Lys Asn Gly Gly Gly Ser Lys Asn Asn Ser Ser Gly Ala 
420 425 430 

Gly Asn Ser Lys Asp Trp Gly Gly Asn Gly His Gly Asn His Arg Asn 
435 440 445 

Asn Ala Ser Asp Leu Asn Lys Pro Asp Gly Asn Asn Gly Asn Asn Gin 
45t) 455 460 

Asn Asn Gly Ser Asn Gin Asp Asn His Ser Asp Val Asn Ala Pro Asn 
465 470 475 480 

Asn Pro Gly Arg Asn Tyr Asp lie Tyr Asp Pro Leu Ala Leu Asp Leu 
485 490 495 

Asp Gly Asp Gly Leu Glu Thr Val Ser Met Asn Gly Arg Gin Gly Ala 
500 505 510 

Leu Phe Asp His Glu Gly Lys Gly lie Arg Thr Ala Thr Gly Trp Leu 

515 520 525 

Ala Ala Asp Asp Gly Phe Leu Val Leu Asp Arg Asn Gin Asp Gly lie 
530 535 540 

lie Asn Asp lie Ser Glu Leu Phe Ser Asn Lys Asn Gin Leu Ser Asp 
545 550 555 560 

Gly Ser lie Ser Ala His Gly Phe Ala Thr Leu Ala Asp Leu Asp Thr 
565 570 575 

Asn Gin Asp Gin Arg lie Asp Gin Asn Asp Lys Leu Phe Ser Lys Leu 
580 585 590 

Gin lie Trp Arg Asp Leu Asn Gin Asn Gly Phe Ser Glu Ala Asn Glu 
595 600 605 

Leu Phe Ser Leu Glu Ser Leu Asn He Lys Ser Leu His Thr Ala Tyr 
610 615 620 

Glu Glu Arg Asn Asp Phe Leu Ala Gly Asn Asn He Leu Ala Gin Leu 
625 630 635 640 

Gly Lys Tyr Glu Lys Thr Asp Gly Thr Phe Gly Gin Met Gly Asp Leu 
645 650 655 

Asn Phe Ser Phe Asn Pro Phe Tyr Ser Arg Phe Thr Glu Ala Leu Asn 
660 665 670 

Leu Thr Glu Gin Gin Arg Arg Thr He Asn Leu Thr Gly Thr Gly Arg 
675 680 685 



47 



Val Arg Asp Leu Arg Glu Ala Ala Ala Leii Ser Glu Glu Leu Ala Ala 
690 695 700 

Leu Leu Gin Gin Tyr Thr Lys Gly Ser Asp Phe Gin Ala Gin Arg Glu 
705 710 715 720 

Leu Leu Pro Ala lie Leu Asp Lys Trp Ala Ala Thr Asp Leu Gin Tyr 
725 730 735 

Gin His Tyr Asp Lys Thr Leu Leu Lys Thr Val Glu Ser Thr Asp Ser 
740 745 750 

Ser Ala Ser Val Val Arg Val Thr Pro Ser Gin Leu Ser Ser lie Arg 
755 760 765 

Asn Val Lys His Asp Pro Thr Val Met Gin Asn Cys Glu Gin Ser Lys 
770 775 780 

Ala Lys lie Ala Thr Leu Asn Ser Leu Tyr Gly Leu Asn lie Asp Gin 
785 790 795 800 

Leu Tyr Tyr Thr Thr Asp Lys Asp lie Arg Tyr lie Thr Asp Lys Val 
805 810 815 

Asn Asn Met Tyr Gin Thr Thr Gly Glu Leu Gly Tyr Arg Ser Leu Leu 
820 825 830 

Leu Gin Thr Arg Val Lys Lys Tyr Val Tyr Ser Val Asn Ala Lys Gin 
835 840 845 

Phe Glu Gly Lys Trp Val Ala Asp Tyr Ser Arg Thr Glu Ala Leu Phe 
850 855 860 

Asn Ser Thr Tyr Lys Gin Ser Pro Glu Asn Val Leu Tyr Asp Leu Arg 
865 870 875 880 

Glu Tyr Leu Ser Phe Tyr Asn Asp Pro Thr Glu Trp Lys Glu Gly Leu 
885 890 895 

Leu Leu Leu Ser Arg Tyr He Asp Tyr Ala Lys Ala Gin Gly Phe Tyr 
900 905 910 

Glu Asn Trp Ala Ala Thr Ser Asn Leu Thr He Ala Arg Leu Arg Glu 
915 920 925 

Ala Gly Val He Cys Ala Glu Ser Thr Asp Leu Lys Gly Asp Glu Lys 
930 935 940 

Asn Asn He Val Leu Gly Ser Gin Lys Asp Asn Asn Leu Ser Gly Ser 
945 950 955 960 

Ala Gly Asp Asp Leu Leu He Gly Gly Glu Gly Asn Asp Thr Leu Lys 
965 970 975 

Gly Ser Tyr Gly Ala Asp Thr Tyr He Phe Ser Lys Gly His Gly Gin 
980 985 990 

Asp Val He Tyr Glu Tyr Ser Asp Ser Ala Asn Ser Lys Lys Asp lie 
995 1000 1005 

Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala Glu Val Lys Phe Arg 
1010 1015 1020 

Arq Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His Asp Thr Asp Ser 
1025 1030 1035 1040 

Val Thr Val Lys Ser Phe Tyr Ser His Val Asp Tyr Gin Phe Asp Lys 
1045 1050 1055 



48 



Leu Glu Phe Ala Asp Arg Ser lie Thr Arg Asp Glu Leu lie Lys Ala 
1060 1065 1070 

Gly Leu His Leu Tyr Gly Thr Asp Gly Asn Asp Asp lie Lys Asp His 
1075 1080 1085 

Ala Asp Trp Asp Ser lie Val Glu Gly Gly Lys Gly Asn Asp lie Leu 
1090 1095 1100 

Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe Ser Lys Gly His Gly 
1105 1110 1115 1120 

Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg Ala Arg Asp 
1125 1130 1135 

He Asp Thr Leu Thr Phe Thr Asp Val Asn Tyr Ala Glu Val Lys Phe 
1140 1145 1150 

Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His Asp Thr Asp 
1155 1160 1165 

Ser Val Thr He Lys Ser Phe Tyr Asn His Val Asp Tyr Gin Cys Asp 
1170 1175 1180 

Lys Leu Asp Phe Ala Asp Arg Ser He Thr Arg Asp Glu Leu Gly Lys 
1185 -; 1190 1195 1200 

Gin Gly Met Ala Leu Phe Gly Thr Asp Gly Asp Asp Asn He Asn Asp 
1205 1210 1215 

Trp Gly Arg Asn Ser Val He Asp Ala Gly Ala Gly Asn Asp Thr Val 
1220 1225 1230 

Asn Gly Gly Asn Gly Asp Asp Thr Leu He Gly Gly Lys Gly Asn Asp 
1235 1240 1245 

He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe Ser Lys Gly 
1250 1255 1260 

His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg Ala 
1265 1270 1275 1280 

Arg Asp He Asp Thr Leu Lys Phe Thr Asp He Asn Leu Ser Glu Leu 
1285 1290 1295 

Trp Phe Ser Arg Glu Asn Asn Asp Leu He He Lys Ser Leu Leu Ser 
1300 1305 1310 

Glu Asp Lys Val Thr Val Gin Asn Trp Tyr Ser His Gin Asp His Lys 
1315 1320 1325 

He Glu Asn He Arg Leu Ser Asn Glu Gin Met Leu Val Ser Thr Gin 
1330 1335 1340 

Val Glu Lys Met Val Glu Ser Met Ala Gly Phe Ala Gin Lys His Gly 
1345 1350 1355 1360 

Gly Glu He Ser Leu Leu Ser Pro Glu Glu Val Lys Gin Tyr He Asn 
1365 1370 1375 - 

Ser Leu Thr Ala Ala Leu * 
1380 



n 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinobacillus pleuropneumoniae 

(B) STRAIN: 4074 (serotype 1 reference strain) 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: pROK? 



(ix) FEATURE: 

(A) NAME/KEY_: CDS 

(B) liOC&TION: 1132. .654 9 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /codon Starts 1132 

/functions "RTX toxin*^ 
/products "J^xIV 
/evidence^ EXPERIMENTAL 
/genes «ApxIV_vl" 

(ix). FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 639, .1178 

(D) OTHER INFORMATION : /codon Starts 639 
/functions "unknown" " 
/products "ORFl" 
/genes "orfl" 



(B) 
(D) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
LOCATION:!. .453 
OTHER INFORMATION: /partial 
/products "Met-G" 
/genes "mrp" 
/standard_names "mrp" 
/labels mrp 

(ix) FEATURE: 

(A) NAME/KEY: -10 signal 

(B) LOCATION: 617. T623 

(D) OTHER INFORMATION: /standard narae= "-10" 
/labels -10_s 

(ix) FEATURE: 

(A) NAME/KEY: -35 signal 

(B) U)CATION: 594. 7599 

(D) OTHER INFORMATION: /standard names "-35 s" 
/labels -35_s - 

(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATION: 454. .1131 

(D) OTHER INFORMATION :/fvinction= "Promoter" 
/standard_name= "promoter ApxIV" 
/labels promoter 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATC GAT ATG CCG CCG GGT ACG GGC GAT ATC CAA CTT ACT CTT TCG CAA - 
He Asp Met Pro Pro Gly Thr Gly Asp He Gin Leu Thr Leu Ser Gin • 
15 10 15 

CAA ATT CCG GTT ACC GGT GCG GTG GTG GTA ACC ACT CCG CAA GAT ATT Qfi 
Gin He Pro Val Thr Gly Ala Val Val Val Thr Thr Pro Gin Asp lie 
20 25 30 

GCG TTA TTA GAT GCG GTG AAA GGT ATT TCA ATG TTC CAA AAA GTG TCG 144 
Ala Leu Leu Asp Ala Val Lys Gly He Ser Met Phe Gin Lys Val Ser 
35 40 45 

HT? £^ 9^ 7^^ 29'^ ^T^ ^T^ AAT ATG AGC GTA cat ATT nnn 
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AAT TGC GGT CAC CAC GAA GAT ATT TTC GGC ACC GGC GOT GCG GAG AAA 240 
Asn Cys Gly His His Glu Asp lie Phe Gly Thr Gly Gly Ala Glu Lys 
65 70 75 80 

GTG GCG AAG AAA TAG GGT ACT AAA GTA TTA GGA CAA ATG CCG TTG CAT 288 
Val Ala Lys Lys Tyr Gly Thr Lys Val Leu Gly Gin Met Pro Leu His 
85 90 95 

ATT CGC TTA CGT CAA GAT TTG GAT GCC GGC ACA CCG ACC GTC GTT GCG 336 
lie Arg Leu Arg Gin Asp Leu Asp Ala Gly Thr Pro Thr Val Val Ala 
100 105 110 

GCA CCG GAA CAC GAA ACC AGC CGA GCC TAT ATT GAA TTA GCG GCA AAA 384 
Ala Pro Glu His Glu Thr Ser Arg Ala Tyr lie Glu Leu Ala Ala Lys 
115 120 125 

GTC GCT TCG GAA TTA TAC TGG CAA GGT TCG GTT ATC CCG TCT GAA ATT 432 
Val Ala Ser Glu Leu Tyr Trp Gin Gly Ser Val lie Pro Ser Glu lie 
130 135 140 

ATG ATT CGT GAA GTA AAA TAA GTTTTAATAA CCACGAAAAC ACAAAGAACA 483 
Met lie Arg Glu Val Lya ♦ 
145 150 

CAAGCGGTAG AATTTGCAGA AAAATTTGCA AATCCTACCG CTTTTTTATT AGTACGATTC 543 

GCTGTTGGAC TGCTATTTGA TTTGGTTTGT CAGGATATTA TGTTATTGTA ATGAAATGTT 603 

AOTGAATTAT TTTTATTAAT TTGAAAGGAA ACAAAATGAA AATAAAAAAA CGTTACATTG 663 

CGCTGTTGGT CTTAGGTGTC GTTATCAGCT ATGCCTGGTA TCAAAATTAT CAATGGGTUVC 723 

AGCTGATGTT AAGCGGTTAT TGTGAAAAGG ACGGAAGTTA TTTTGATGAT A6GCATACGA 783 

AGCAAQAACT GATTGATAGG GCAATTAACT ATATGCTGGA GCATCAATCT AAAAAAACAT 843 

ACGATGCTTA TACTGATGAA CCTTTAGAAA TAAAACCATA TTTAACAATA GAGGAATTTA 903 

AAAAACTCAA TCCAAATTGT TGTGAAATTA CCTCATGGCC A6CAGATGCA GTTCCACAAG 963 

ATTGGGATGT TCGTGTGGAA GGTAAGGCAT ATAGGTATGT AATC6TAAAA TATTTAAQAA 1023 

CCTTAGCAAA TAGAGAACCT GAACGATGGG AAACTAGTAT TGTTTTTGAT AATTGCGGCA 1083 

ATCCTAAAAG AGCAAGCTAC TTATATTATT TAAAGAGAGA AATTTATT ATG ACA AAA 1140 

Met Thr Lys 

TTA ACT ATG CAA GAT GTG ACC AAT TTA TAT TTA TAT AAA ACG AAA ACT 1188 
Leu Thr Met Gin Asp Val Thr Asn Leu Tyr Leu Tyr Lys Thr Lys Thr 
5 10 15 

CTA CCT AAA GAT AGA TTG GAT GAT TCA CTT ATT TCT GAA ATA GGA AAA 1236 
Leu Pro Lys Asp Arg Leu Asp Asp Ser Leu lie Ser Glu He Gly Lys 
20 25 30 35 

GGA GAT GAT GAT ATT GAT AGA AAA GAA TTT ATG GTG GGG CCG GGA CGT 1284 
Gly Asp Asp Asp lie Asp Arg Lys Glu Phe Met Val Gly Pro Gly Aror 
40 45 50 

TTT GTG ACC GCT GAT AAC TTT AGC GTT GTA AGA GAT TTT TTT AAT GCT 1332 
Phe Val Thr Ala Asp Asn Phe Ser Val Val Arg Asp Phe Phe Asn Ala 
55 60 65 

GGG AAA TCA CGC ATT ATT GCG CCG CAA GTC CCG CCT ATT CGT TCA CAG 1380 
Gly Lys Ser Arg He He Ala Pro Gin Val Pro Pro He Arg Ser Gin 
70 75 80 

CAG GAA AAA ATC TTG GTC GGT TTA AAA CCG GGC AAA TAT TCC AAA GCG 1428 
Gin Glu Lys He Leu Val Gly Leu Lys Pro Gly Lys TVr Ser Lys Ala 
85 90 95 

CAG ATA TTG GAA ATG CTG GGT TAT ACG AAA GGC GGA GAA GTG GTA AAT ' 1476 
Gin He Leu Glu Met Leu Gly Tyr Thr Lys Gly Gly Glu Val Val Asn 
100 105 110 lis 

GGC ATG TTT GCC GGT GAA GTC CAG ACA TTA GGC TTT TAT GAC GAT GGC 1524 
Gly Met Phe Ala Gly Glu Val Gin Thr Leu Gly Phe Tyr Asp Asp Gly 
120 125 130 

AAA GGG GAT TTA CTC GAA. CGC GCC TAT ATC TGG AAT ACC ACA GGA TTT 1572 
Lys Gly Asp Leu Leu Glu Arg Ala Tyr He Trp Asn Thr Thr Gly Phe 
135 140 145 
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Lys Met Ser Asp Aan Ala Phe Phe Val lie Qlu Glu Ser Gly Lys Arg 

150 155 160 

TAT ATT GAA AAC TTT GGT ATT GAA CCT CTT GGT AAG CAA GAA GAT TTT 
Tyr lie Glu Asn Phe Gly lie Glu Pro Leu Gly Lys Gin Glu Asp Phe 
165 170 175 

GAT TTT GTC GGC GGC TTT TGG TCT AAC TTA GTG AAT CGT GGT TTG GAA 
Asp Phe Val Gly Gly Phe Trp Ser Asn Leu Val Asn Arg Gly Leu Glu 
180 185 190 195 

AGT ATT ATC GAC CCA TCC GGT ATC GGT GGA ACG GTA AAC CTT AAC TTT 
Ser lie lie Asp Pro Ser Gly lie Gly Gly Thr Val Asn Leu Asn Phe 
200 205 210 

ACC GGC GAG GTG GAA ACC TAC ACG TTA GAC GAA ACA AGG TTT AAA GCG 
Thr Gly Glu Val Glu Thr Tyr Thr Leu Asp Glu Thr Arg Phe Lys Ala 
215 220 225 

GAA GCG GCG AAG AAA AGC CAT TGG AGT TTA GTG AAT GCG GCG AAA GTA 
Glu Ala Ala Lys Lys Ser His Trp Ser Leu Val Asn Ala Ala Lys Val 

230 235 240 



TAC GGC GGT TTA GAC CAA ATT ATT AAA AAA CTA TGG GAC AGT GGC TCA 
Tyr Gly Gly Leu Asp Gin lie lie Lys Lys Leu Trp Asp Ser Gly Ser 
24S 250 255 



ATT AAG CAT TTA TAT CAA GAT AAA GAT ACG GGC AAA TTA AAA CCG ATT 
lie Lys His Leu Tyr Gin Asp Lys Asp Thr Gly Lys Leu Lys Pro He 
260 265 270 275 

ATT TAC-JGGC ACG GCC GGC AAC GAC AGT AAG ATT GAA GGC ACT AAA ATC 
He Tyr Gly Thr Ala Gly Asn Asp Ser Lys He Glu Gly Thr Lys He 
280 285 290 

ACC CGT AGG ATT GCG GGT AAA GAA GTT ACG CTT GAT ATT GCC AAT CAG 
Thr Arg Arg He Ala Gly Lys Glu Val Thr Leu Asp He Ala Asn Gin 
295 300 305 

AAA ATT GAA AAA GGC GTG TTA GAG AAA TTG GGG CTG TCT GTT AGT GGT 
Lys He Glu Lys Gly Val Leu Glu Lye Leu Gly Leu Ser Val Ser Gly 
310 315 320 

TCG GAT ATC ATT AAA TTG TTG TTT GGA GCA TTG ACT CCA ACT TTA AAT 
Ser Asp He He Lys Leu Leu Phe Gly Ala Leu Thr Pro Thr Leu Asn 
325 330 335 

AGA ATG TTG CTA TCA CAA CTT ATC CAG TCT TTT TCC GAT AGC TTG GCT 
Arg Met Leu Leu Ser Gin Leu He Gin Ser Phe Ser Asp Ser Leu Ala 
340 345 350 355 

AAA CTT GAT AAT CCC TTA GCC CCT TAC ACT AAA AAT GGC GTG GTT TAT 
Lys Leu Asp Asn Pro Leu Ala Pro Tyr Thr Lys Asn Gly Val Val Tyr 
360 365 370 

GTC ACC GGC AAA GGG AAT GAT GTG CTT AAA GGA ACT GAA CAT GAG GAT 
Val Thr Gly Lys Gly Asn Asp Val Leu Lys Gly Thr Glu His Glu Asp 
375 380 385 

TTG TTT CTC GGT GGT GAG GGG AAT GAT ACT TAT TAT GCG AGA GTA GGC 
Leu Phe Leu Gly Gly Glu Gly Asn Asp Thr Tyr Tyr Ala Arg Val Gly 
390 395 400 

GAT ACA ATT GAA GAC GCC GAC GGC AAA GGT AAA GTC TAT TTT GTG AGA 
Asp Thr He Glu Asp Ala Asp Gly Lys Gly Lys Val Tyr Phe Val Arg 
405 410 415 

GAA AAA GGG GTA CCT AAG GCG GAT CCT AAG CGG GTA GAG TTT AGC GAG 
Glu Lys Gly Val Pro Lys Ala Asp Pro Lys Arg Val Glu Phe Ser Glu 
420 425 430 435 

TAC ATA ACG AAA GAA GAA ATA AAA GAG GTT GAA AAG GGG TTA TTA ACT 
Tyr He Thr Lys Glu Glu He Lys Glu Val Glu Lys Gly Leu Leu Thr 
440 445 450 

TAC GCA GTT TTA GAA AAT TAT AAT TGG GAA GAG AAA ACG GCG ACT TTC 
Tyr Ala Val Leu Glu Asn Tyr Asn Trp Glu Glu Lys Thr Ala Thr Phe 
455 460 465 

GCT CAT GCG ACT ATG CTT AAT GAG CTT TTT ACT GAT TAT ACT AAT TAT 
Ala His Ala Thr Met Leu Asn Glu Leu Phe Thr Asp Tyr Thr Asn Tyr 
470 475 480 
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485 490 495 

AGT CCG TTG GTG GAG TTT ACA GCT GAT TTA TTA ACT GTT ACQ CCT ATT 2676 
Ser Pro Leu Val Glu Phe Thr Ala Asp Leu Leu Thr Val Thr Pro lie 
500 505 510 515 

GAC GAA 7VAC GGA AAA GCA CTT AGC GAA AAA AGT ATT ACG GTT AAA AAT 2724 
Asp Glu Asn Gly Lys Ala Leu Ser Glu Lys Ser lie Thr Val Lys Asn 
520 525 530 

TTT AAA AAT GGT GAT TTA GGA ATA AGG TTG TTG GAT CCT AAT AGC TAT 2772 
Phe Lys Asn Gly Asp Leu Gly He Arg Leu Leu Asp Pro Asn Ser Tyr 

535 540 545 

TAT TAT TTC CTT GAA GGC CAA GAT ACG GGT TTT TAT GGT CCT GCT TTT 2820 
Tyr Tyr Phe Leu Glu Gly Gin Asp Thr Gly Phe Tyr Gly Pro Ala Phe 
550 555 560 

TAT ATT GAA CGA AAA AAC GGT GGC GGC GCT AAA AAT AAC TCG TCQ GGA 2868 
Tyr He Glu Arg Lys Asn Gly Gly Gly Ala Lys Asn Asn Ser Ser Gly 
565 570 575 

GCA GGA AAT AGC AAA GAT TGG GGC GGG AAC GGG CAT GGA AAT CAC CGA 2516 
Ala Gly Asn Ser Lys Asp Trp Gly Gly Asn Gly His Gly Asn His Arg 
580 585 590 595 

AAT AAT GCC TCC GAC CTG AAT AAA CCG GAC GGA AAT AAT GGG AAT AAC 2964 
Asn Asn Ala Ser Asp Leu Asn Lys Pro Asp Gly Asn Asn Gly Asn Asn 
600 605 610 

CAA AAT AAC GGA AGC AAT CAA GAT AAT CAT AGC GAT GTG AAT GCG CCA 3012 
Gin Asn Asn Gly Ser Asn Gin Asp Asn His Ser Asp Val Asn Ala Pro 

615 620 625 

AAT AAC CCG GGA CGT AAC TAT GAT ATT TAG GAT CCT TTA GCT TTA GAT 3060 
Asn Asn Pro Oly Arg Asn Tyr Asp lie Tyr Asp Pro Leu Ala Leu Asp 
630 635 640 

TTA GAT GGA GAT GGG CTT GAA ACC GTG TCG ATG AAC GGG CGA CAA GGC 3108 
lieu Asp Gly Asp Gly Leu Glu Thr Val Ser Met Asn Gly Arg Gin Gly 
645 650 655 

GCG TTA TTC GAT CAT GAA GGA AAA GGT ATT CGT ACC GCA ACG GGC TGG 3156 
Ala Leu Phe T^p His Glu Gly Lys Gly He Arg Thr Ala Thr Gly Trp 
660 665 670 675 

CTC GCT GCG GAT GAC GGT TTT TTA GTG TTA GAT CGT AAC CAA GAC GGC 3204 
Leu Ala Ala Asp Asp Gly Phe Leu Val Leu Asp Arg Asn Gin Asp Gly 
680 685 690 

ATT ATT AAT GAT ATA AGC GAG TTA TTT AGT AAT AAA AAT CAA CTT TCC 3252 
He He Asn Asp He Ser Glu Leu Phe Ser Asn Lys Asn Gin Leu Ser 
695 700 705 

GAC GGC AGT ATT TCT GCA CAC GGT TTT GCG ACA TTA GCC GAT TTG GAT 3300 
Asp Gly Ser He Ser Ala His Gly Phe Ala Thr Leu Ala Asp Leu Asp 
710 715 720 

ACA AAC CAA GAT CAG CGT ATC GAC CAA AAT GAT AAG CTG TTT TCT AAA 3348 
Thr Asn Gin Asp Gin Arg He Asp Gin Asn Asp Lys Leu Phe Ser Lys 
725 730 735 

CTC CAA ATT TGG CGG GAT TTA AAT CAA AAC GGT TTT AGT GAA GCG AAT 3396 
Leu Gin He Trp Arg Asp Leu Asn Gin Asn Gly Phe Ser Glu Ala Asn 
740 745 750 755 

GAG CTG TTT AGC TTA GAA AGT TTG AAT ATT AAA TCT TTA CAT ACC GCC 3444 
Glu Leu Phe Ser Leu Glu Ser Leu Asn He Lys Ser Leu His Thr Ala 
760 765 770 

TAT GAA GAG CGT AAT GAT TTT CTA GCG GGC AAT AAT ATC CTT GCT CAG - 3492 
Tyr Glu Glu Arg Asn Asp Phe Leu Ala Gly Asn Asn He Leu Ala Gin . 

775 780 785 



CTT GGG AAG TAT GAA AAA ACG GAC GGT ACT TTT GCA CAA ATG GGC GAT 3540 
Leu Gly Lys Tyr Glu Lys Thr Asp Gly Thr Phe Ala Gin Met Gly Asp 
790 795 800 



TTA AAT TTC AGT TTT AAC CCG TTT. TAT AGC CGA TTT ACC GAA GCG TTA 3588 

Leu Asn Phe Ser Phe Asn Pro Phe Tyr Ser Ajcg Phe Thr Glu Ala Leu 

805 810 815 

nn'P •nmn tK r*r* r» j* r« r^is /^ti * r^r^m r^r^r^ «rrwn •niim ^•«m« « «««««« - — — - -• 
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CGG GTT CGG GAT TTG CX3T GAA GCC GCC GCA CTT TCT GAG GAG TTG GCT 3684 
Are Val Arq Asp Leu Arg Glu Ala Ala Ala Leu Ser Glu Glu Leu Ala 
^ 840 845 850 

GCT TTA TTA CAA CAG TAC ACT AAG GCC TCC GAT TTT CAG GCA CAA CGA 3732 
Ala Leu Leu Gin Gin Tyr Thr Lys Ala Ser Asp Phe Gin Ala Gin Arg 
855 860 865 

GAA TTA TTG CCT GCC ATT TTA GAT AAA TGG GCG GCA ACG GAT TTA CAG 3780 
Glu Leu Leu Pro Ala lie Leu Asp Lys Trp Ala Ala Thr Asp Leu Gin 
870 875 880 

TAT CAA CAT TAT GAT AAA ACA TTA CTT AAA ACG GTA GAA AGT ACC GAT 3828 
Tvx Gin His Tyr Asp Lys Thr Leu Leu Lys Thr Val Glu Ser Thr Asp 
^ 885 890 895 

AGT AGT GCT TCT GTC GTT AGA GTC ACG CCT TCT CAA TTA AGT AGT AtA 3876 
Ser Ser Ala Ser Val Val Arg Val Thr Pro Ser Gin Leu Ser Ser lie 
900 905 910 915 

CGC AAT GCA AAG CAT GAT CCT ACC GTT ATG CAA AAC TTT GAA CAG AGT 3924 
Ara Asn Ala Lys His Asp Pro Thr Val Met Gin Asn Phe Glu Gin Ser 
^ 920 925 930 

AAG GCA AAA ATT GCG ACT TTA AAT TCG CTC TAC GGG TTA AAT ATC GAT 3972 
Lvs Ala Lys lie Ala Thr Leu Asn Ser Leu Tyr Gly Leu Asn lie Asp 
^ 935 940 945 

CAA CTT TAT TAC ACG ACG GAT AAA GAC ATT CGC TAT ATT ACT GAT AAA 4020 
Gin Leu Tyr Tyr Thr Thr Asp Lys Asp lie Arg Tyr lie Thr Asp Lys 
950 955 960 

GTG AAT AAT ATG TAT CAA ACA ACC GTA GAA CTT GCC TAC CGT TCT TTA 4068 
Val Asn Asn Met Tyr Gin Thr Thr Val Glu Leu Ala Tyr Arg Ser Leu 
965 970 975 

CTT TTA CAA ACG CGT TTG AAG AAA TAT GTT TAT AGC GTT AAT GCG AAA 4116 
Leu Leu Oln Thr Arg Leu Lys Lys Tyr Val Tyr Ser Val Asn Ala Lys 
980 985 990 995 

CAA TTC GAA GGG AAA TGQ GTA ACC GAT TAT TCT CGT ACT GAA GCC TTA 4164 
Gin Phe Glu Gly Lys Trp Val Thr Asp Tyr Ser Arg Thr Glu Ala Leu 
1000 1005 1010 

TTT AAC TCT ACT TTT AAA CAA TCG CCT GAA AAT GCA TTA TAT GAT TTA 4212 
Phe Asn Ser Thr Phe Lys Gin Ser Pro Glu Asn Ala Leu Tyr Asp Leu 
1015 1020 1025 

AGC GAA TAC CTT TCT TTC TTT AAC GAT CCT ACG GAA TGG AAA GAA GGG 4260 
Ser Glu Tyr Leu Ser Phe Phe Asn Asp Pro Thr Glu Trp Lys Glu Gly 
1030 1035 1040 

CTA TTA CTG TTA AGC CGT TAT ATA GAT TAT GCT AAA GCA CAA GGA TTT 4308 
Leu Leu Leu Leu Ser Arg Tyr lie Asp Tyr Ala Lys Ala Gin Gly Phe 

1045 1050 1055 

TAT GAA AAC TGG GCG GCT ACT TCT AAC TTA ACT ATT GCC CGT TTA AGA 4356 
Tyr Glu Asn Trp Ala Ala Thr Ser Asn Leu Thr He Ala Arg Leu Arg 
1060 1065 1070 1075 

GAG GCT GGA GTA ATT TTT GCA GAA TCG ACG GAT TTA AAA GGC GAT GAA 4404 
Glu Ala Gly Val He Phe Ala Glu Ser Thr Asp Leu Lys Gly Asp Glu 
1080 1085 1090 

AAA AAT AAT ATT TTG TTA GGT AGC CAA AAA GAT AAT AAC TTA TCG GGT 4452 
Lys Asn Asn He Leu Leu Gly Ser Gin Lys Asp Asn Asn Leu Ser Gly 
1095 1100 1105 

AGT GCA GGT GAT GAT CTA CTT ATC GGC GGA GAG GGT AAT GAT ACG TTA 4500 
Ser Ala Gly Asp Asp Leu Leu He Gly Gly Glu Gly Asn Asp Thr Leu 
1110 1115 1120 

AAA GGC AGC TAC GGT GCA GAC ACC TAT ATC TTT AGC AAA GGA CAC GGA 4548 
Lys Gly Ser Tyr Gly Ala Asp Thr Tyr He Phe Ser Lys Gly His Gly 
^ 1125 1130 1135 

CAG GAT ATC GTT TAT GAA GAT ACC AAT AAT GAT AAC CGC GCA AGA GAT 4596 
Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg Ala Arg Asp 
1140 1145 1150 1155 

ATC GAC ACC TTA AAA TTT ACC GAT GTG AAT TAT GCG GAA GTG AAG TTT 4644 
He Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala Glu Val Lys Phe 
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Ara ArcT Val Asp Asn Asp Leu Met Leu Phe Oly Tyr His Asp Thr Asp 
^ ^ 1175 1180 1185 

TCG GTC ACG GTA AAA TCC TTC TAC AGC CAT GTA GAT TAT CAA TTT GAC 
Ser Val Thr Val Lys Ser Phe Tyr Ser His. Val Asp Tyr Gin Phe Asp 
1190 1195 1200 

AAA TTG GAG TTT GCT GAC CGC AGT ATA ACT CGC GAT GAA CTG ATT AAA 
Lvs Leu Glu Phe Ala Asp Arg Ser lie Thr Arg Asp Glu Leu lie Lys 
' 1205 1210 1215 

OCA GGG CTT CAT CTA TAC GGC ACC GAT GGC AAT GAT GAT ATA AAG GAT 
Ala Gly Leu His Leu Tyr Gly Thr Asp Gly Asn Asp Asp He Lys Asp 
1220 1225 1230 1235 

CAT GCG GAT TGG GAC AGC ATT TTG GAA GGC GGC AAA GGC AAC GAT ATT 
His Ala Asp Trp Asp Ser He Leu Glu Gly Gly Lys Gly Asn Asp lie 
1240 1245 1250 

CTA AGA GGT GGC TAC GOT GCG GAC ACC TAT ATC TTT AGC AAA GGA CAC 
Leu Arg Gly Gly TVr Gly Ala Asp Thr Tyr He Phe Ser Lys Gly His 
1255 1260 1265 

GGA CAG GAT ATC GTT TAT GAA GAT ACC AAT AAT GAT AAC CGC GCA AGA 
Gly Gin Asp lie Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg Ala Arg 
' 1270 1275 1280 

GAT ATC GAC ACC TTA A7VA TTT ACT GAT GTG AAT TAT GCG GAA GTG AAA 
AsD He Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala Glu Val Lys 
^ 1285 1290 1295 

TTC CGA -CGA GTA GAT AAT GAC TTA ATG TTA TTC GGT TAT CAT GAT ACG 
Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His Asp Thr 
1300 1305 1310 1315 

GAT TCG GTC ACG ATA AAA TCC TTC TAC AAC CAT GTA GAT TAT C7^ TTT 
Asp Ser Val Thr He Lys Ser Phe Tyr Asn His Val Asp Tyr Gin Phe 
^ 1320 1325 1330 

GAC AAA TTG GAA TTT GCT GAC CGC AGT ATA ACT CGT GAT GAA CTA GOT 
Asp Lys Leu Glu Phe Ala Asp Arg Ser He Thr Arg Asp Glu Leu Gly 
1335 1340 1345 

AAA CAA GGT ATG GCA TTA TTT GGC ACT GAC GGT GAT GAT AAT ATC AAC 
Lys Gin Gly Met Ala Leu Phe Gly Thr Asp Gly Asp Asp Asn He Asn 
1350 1355 1360 

GAC TGG GGA CGT AAC TCG GTG ATT GAT GCC GGT GCG GGT AAT GAT ACG 
Asp Trp Gly Arg Asn Ser Val He Asp Ala Gly Ala Gly Asn Asp Thr 
^ 1365 1370 1375 

GTT AAT GGC GGT TAT GGC GAT GAC ACC CTC ATC GGC GGC AAA GGT AAT 
Val Asn Gly Gly Asn Gly Asp Asp Thr Leu He Gly Gly Lys Gly Asn 
1380 1385 1390 1395 

GAT ATT CTA AGA GGT GGC TAC GGT GCG GAC ACC TAT ATC TTT AGC AAA 
Asp He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe Ser Lys 
1400 1405 1410 

GGA CAC GGA CAG GAT ATC GTT TAT GAA GAT ACC AAT AAT GAT AAC CGC 
Gly His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg 
1415 1420 1425 

GCA AGA GAT ATC GAC ACC TTA AAA TTT ACC GAT GTG AAT TAT GCG GAA 
Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala Glu 
1430 1435 1440 

GTG AAA TTC CGA CQA GTA GAT AAT GAC TTA ATG TTA TTC GOT TAT CAT 
Val Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His 
1445 1450 1455 

GAT ACG GAT TCG GTC ACG GTA AAA TCC TTC TAC AGC CAT GTA GAT TAT' 
Asp Thr Asp Ser Val Thr Val Lys Ser Phe Tyr Ser His Val Asp Tyr 
1460 1465 1470 1475 

CAA TTT GAC AAA TTG GAG TTT GCT GAC CGC AGT ATA ACT CGC GAT GAA 
Gin Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser He Thr Arg Asp Glu 
1480 1485 1490 

CTG ATT AAA GCA GGG CTT CAT CTA TAC GGC ACC GAT GGC AAT GAT GAT 
Leu He Lys Ala Gly Leu His Leu Tyr Gly Thr Asp Gly Asn Asp Asp 
1495 1500 1505 
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1510 1515 1520 

AAC GAT ATT CTA AGA GGT GGC TAC GGT GCG GAC ACC TAT ATC TTT AGC 5748 
Asn Asp He Leu Tlrg Gly Gly Tyr Gly Ala Asp Thr Tyr lie Phe Ser 
1525 1530 1535 

AAA GGA CAC GGA CAG GAT ATC GTT TAT GAA GAT ACC AAT AAT GAT AAC 5796 
Lva Glv His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn 
l|40 1545 1550 1555 

CGA OCA AGA GAT ATC GAC ACC TTA AAA TTT ACT GAT GTG AAT TAT GCG 5844 
Arq Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala 
^ 1560 1565 1570 

GAA GTG AAA TTC CGA CGA GTA GAT AAT GAC TTA ATG TTA TTC GGT TAT 5892 
Glu Val Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr 
1575 1580 1585 

CAT GAT ACG GAT TCG GTC ACG ATA AAA TCC TTC TAC AAC CAT GTA GAT 5940 
His Asp Thr Asp Ser Val Thr He Lys Ser Phe Tyr Asn His Val Asp 
1590 1595 1600 

TAT CAA TTT GAC AAA TTG GAA TTT GCT GAC CGC AGT ATA ACT CGT GAT 5988 
Tvr Gin Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser He Thr Arg Asp 
^ 1605 1610 1615 

GAA CTA GGT AAA CAA GGT ATG GCA TTA TTT GGC ACT GAC GGT GAT GAT 6036 
Glu Leu Gly Lys Gin Gly Met Ala Leu Phe Gly Thr Asp Gly Asp Asp 
1620 1625 1630 1635 

AAT ATC AAC GAC TGG GGA CGT AAC TCG GTG ATT GAT GCC GGT GCG GGT 6084 
Asn Ile -Asn Asp Trp Gly Arg TVsn Ser Val He Asp Ala Gly Ala Gly 
1640 1645 1650 

AAT GAT ACG GTT AAT GGC GGT AAT GGC GAT GAC ACC CTC ATC GGC GGC 6132 
Asn Asp Thr Val Asn Gly Gly Asn Gly Asp Asp Thr Leu He Gly Gly 
1655 1660 1665 

AAA GGT AAT GAT ATT CTA AGA GGT GGC TAC GGT GCG GAC ACC TAT ATC 6180 
Lys Gly Asn Asp He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He 
' 1670 1675 1680 

TTT AGC AAA GGA CAC GGA CAG GAT ATC GTT TAT GAA GAT ACC AAT AAT 6228 
Phe Ser Lys Gly His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn 
1685 1690 1695 

GAT AAC CGC GCA AGA GAT ATC GAC ACC TTA AAA TTT ACT GAT ATT AAT 6276 
Asp Asn Arg Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp He Asn 
1700 1705 1710 1715 

TTA TCC GAA CTT TGG TTT AGC CGA GAA AAT AAC GAT TTG ATT ATT AAA 6324 
Leu Ser Glu Leu Trp Phe Ser Arg Glu Asn Asn T^p Leu He He Lys 
1720 1725 1730 

TCA TTA TTA AGT GAG GAT AAA GTC ACG GTT CAA AAT TGG TAT TCA CAC 6372 
Ser Leu Leu Ser Glu Asp Lys Val Thr Val Gin Asn Trp Tyr Ser His 
1735 1740 1745 

CAA GAT CAT AAA ATA GAA AAT ATT CGT TTA TCG AAT GAG CAA ACG TTG 6420 
Gin Asp His Lys He Glu Asn He Arg Leu Ser Asn Glu Gin Thr Leu 
1750 1755 1760 

GTG AGC ACT CAG GTG GAG AAG ATG GTT GAG TCG ATG GCC GGC TTT GCT 6468 
Val Ser Thr Gin Val Glu Lys Met Val Glu Ser Met Ala Gly Phe Ala 
1765 1770 1775 

CAG AAG CAC GGA GGA GAG ATA TCT CTT GTG TCG CTT GAA GAG GTA AAA 6516 
Gin Lys His Gly Gly Glu He Ser Leu Val Ser Leu Glu Glu Val Lys 
1780 1785 1790 1795 

CAA TAT ATC AAT AGC TTA ACA GCT GCT TTA TAA CATACGAAAG AAATCGGCAC 6569 
Gin Tyr He Asn Ser Leu Thr Ala Ala Leu ♦ 
1800 1605 

AGTTTTTTTG AACTGTGCCG ATTTGATTTT AGTGTAAGAA TATAGCCTGA TTTTAAGAAA 6629 

TTTACTCTTG GCTAATAACT ATTTCCCATT TTATAAGTTA TTGACGGATG GTTTTATCAA 6689 

ATATGAGATC AAATCTTATT TTAAATTCGC TTTCCATTAA GCGATAT 6736 



(2) INFORMATION FOR SBQ ID NO: 6: 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULB TYPE: protein 

(xi) SEQUBNCB DESCRIPTION: SEQ ID NO: 6: 



lie Asp 
1 


Met 


Pro 


Pro Gly Thr 

5 


Gly Asp He Gin Leu Thr Leu Ser 

10 15 


Gin 


lie 


Pro 


Val 
20 


Thr Gly Ala 


Val Val Val Thr Thr Pro Gin Asp 
25 30 


Ala 


Leu 


lieu 
35 


Asp 


Ala Val Lys 


Gly He Ser Met Phe Gin Lys Val 
40 45 


Val 


Pro 
50 


Val 


Leu 


Gly lie He 
55 


Glu Asn Met Ser Val His He Cys 
60 


Asn 
65 


Cys 


Glv 


His 


His Glu Asp 
70 


He Phe Glv Thr Glv Glv Ala Glu 
75 


Val 


Ala 


Lys 


Lys 


Tyr Gly Thr 
85 


Lys Val Leu Gly Gin Met Pro Leu 
30 95 


lie Arg 


Leu 


Arg 
100 


Gin Asp Leu 


Asp Ala Gly Thr Pro Thr Val Val 
105 110 


Ala 


Pro 


Glu 
115 


His 


Glu Thr Ser 


Arg Ala Tyr He Glu Leu Ala Ala 
120 125 


Val 


Ala 
130- 


Ser 


Glu 


Leu Tyr Trp 
135 


Gin Gly Ser Val He Pro Ser Glu 
140 


Met 
145 


Ile 


Arg 


Glu 


Val Lys * 
150 





(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1806 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Thr Lys Leu Thr Met Gin Asp Val Thr Asn Leu Tyr Leu Tyr Lys 
1 5 10 15 

Thr Lys Thr Leu Pro Lys Asp Arg Leu Asp Asp Ser Leu He Ser Glu 
20 25 30 

He Gly Lys Gly Asp Asp Asp He Asp Arg Lys Glu Phe Met Val Gly 
35 40 45 

Pro Gly Arg Phe Val Thr Ala Asp Asn Phe Ser Val Val Arg Asp Phe 
50 55 60 

Phe Asn Ala Gly Lys Ser Arg He He Ala Pro Gin Val Pro Pro He 
65 70 75 80 

Arg Ser Gin Gin Glu Lys He Leu Val Gly Leu Lys Pro Gly Lys Tyr 
85 90 ^ 

Ser Lys Ala Gin He Leu Glu Met Leu Gly Tyr Thr Lys Gly Gly Glu 
100 105 110 



Val 



Val Asn Gly Met Phe Ala Gly Glu Val Gin Thr Leu Gly Phe Tyr 
115 120 125 



Asp Asp Gly Lys Gly Asp Leu Leu Glu Arg Ala Tyr He Trp Asn Thr 
130 135 140 

Thr Gly Phe Lys Met Ser Asp Asn Ala Phe Phe Val He Glu Glu Ser 
145 150 155 160 

Gly Lys Arg Tyr He Glu Asn Phe Gly He Glu Pro Leu Gly Lys Gin 
165 170 175 

Glu Asp Phe Asp Phe Val Gly Gly Phe Trp Ser Asn Leu Val Asn Arq 
180 185 190 
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Leu Asn Phe Thr Gly Glu Val Glu Thr Tyr Thr Leu Asp Glu Thr Arg 

210 215 220 

Phe Lys Ala Glu Ala Ala Lys Lys Ser His Trp Ser Leu Val Asn Ala 
225 230 235 240 

Ala Lys Val Tyr Gly Gly Leu Asp Gin lie He Lys Lys Leu Trp Asp 
245 250 255 

Ser Gly Ser He Lys His Leu Tyr Gin Asp Lys Asp Thr Gly Lys Leu 

260 265 270 

Lys Pro He He Tyr Gly Thr Ala Gly Asn Asp Ser Lys He Glu Gly 
275 280 285 

Thr Lys He Thr Arg Arg He Ala Gly Lys Glu Val Thr Leu Asp He 
290 295 300 

Ala Asn Gin Lys He Glu Lys Gly Val Leu Glu Lys Leu Gly Leu Ser 
305 310 315 320 

Val Ser Gly Ser Asp He He Lys Leu Leu Phe Gly Ala Leu Thr Pro 
325 330 335 

Thr Leu Asn Arg Met Leu Leu Ser Gin Leu He Gin Ser Phe Ser Asp 
340 345 350 

Ser Leu Ala Lys Leu Asp Asn Pro Leu Ala Pro Tyr Thr Lys Asn Gly 

355 ' 360 365 

Val Val Tyr Val Thr Gly Lys Gly Asn Asp Val Leu Lys Gly Thr Glu 
370- 375 380 

His Glu Asp Leu Phe Leu Gly Gly Glu Gly Asn Asp Thr Tyr Tyr Ala 
385 390 395 400 

Arc Val Gly Asp Thr He Glu Asp Ala Asp Gly Lys Gly Lys Val Tyr 
405 410 415 

Phe Val Arg Glu Lys Gly Val Pro Lys Ala Asp Pro Lys Arg Val Glu 
420 425 430 

Phe Ser Glu Tyr He Thr Lys Glu Glu He Lys Glu Val Glu Lys Gly 
435 440 445 

Leu Leu Thr Tyr Ala Val Leu Glu Asn Tyr Asn Trp Glu Glu Lys Thr 
450 455 460 

Ala Thr Phe Ala His Ala Thr Met Leu Asn Glu Leu Phe Thr Asp Tyr 
465 470 475 480 

Thr Asn Tyr Arg Tyr Glu Val Lys Gly Leu Lys Leu Pro Ala Val Lys 
485 490 495 

Lys Leu Lys Ser Pro Leu Val Glu Phe Thr Ala Asp Leu Leu Thr Val 
500 505 510 

Thr Pro He Asp Glu Asn Gly Lys Ala Leu Ser Glu Lys Ser He Thr 
515 520 525 

Val Lys Asn Phe Lys Asn Gly Asp Leu Gly He Arg Leu Leu Asp Pro 
530 535 540 

Asn Ser Tyr Tyr Tyr Phe Leu Glu Gly Gin Asp Thr Gly Phe Tyr Gly 
545 550 555 560 

Pro Ala Phe Tyr He Glu Arg Lys Asn Gly Gly Gly Ala Lys Asn Asn 
565 570 575 

Ser Ser Gly Ala Gly Asn Ser Lys Asp Trp Gly Gly Asn Gly His Gly 
580 585 590 

Asn His Arg Asn Asn Ala Ser Asp Leu Asn Lys Pro Asp Gly Asn Asn 
595 600 605 

Gly Asn Asn Gin Asn Asn Gly Ser Asn Gin Asp Asn His Ser Asp Val 
610 615 620 

Asn Ala Pro Asn Asn Pro Gly Arg Asn Tyr Asp He Tyr Asp Pro Leu 
625 630 635 640 

Ala Leu Asp Leu Asp Gly Asp Gly Leu Glu Thr Val Ser Met Asn Gly 
645 650 655 
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660 665 670 

Thr Gly Trp Leu Ala Ala Asp Asp Gly Phe Leu Val Leu Asp Arg Asn 
675 680 685 

Gin Asp Gly He He Asn Asp He Ser Glu Leu Phe Ser Asn Lys Asn 
690 695 700 

Gin Leu Ser Asp Gly Ser He Ser Ala His Gly Phe Ala Thr Leu Ala 
705 710 71S 720 

Asp Leu Asp Thr Asn Gin Asp Gin Arg He Asp Gin Asn Asp Lys Leu 
725 730 735 

Phe Ser Lys Leu Gin He Trp Arg Asp Leu Asn Gin Asn Gly Phe Ser 
740 745 750 

Glu Ala Asn Glu Leu Phe Ser Leu Glu Ser Leu Asn He Lys Ser Leu 
755 760 765 

His Thr Ala Tyr Glu Glu Arg Asn Asp Phe Leu Ala Gly Asn Asn lie 
770 775 780 

Leu Ala Gin Leu Gly Lys Tyr Glu Lys Thr Asp Gly Thr Phe Ala Gin 
785 790 795 800 

Met Gly Asp Leu Asn Phe Ser Phe Asn Pro Phe Tyr Ser Arg Phe Thr 
805 810 815 

Glu Ala Leu Asn Leu Thr Glu Gin Gin Arg Arg Thr He Asn Leu Thr 
820 825 830 

Gly Thr Gly Arg Val Arg Asp Leu Arg Glu Ala Ala Ala Leu Ser Glu 
835 840 845 

Glu Leu Ala Ala Leu Leu Gin Gin Tyr Thr Lys Ala Ser Asp Phe Gin 
850 855 860 

Ala Gin Arg Glu Leu Leu Pro Ala He Leu Asp Lys Trp Ala Ala Thr 
865 870 875 880 

Asp Leu Gin Tyr Gin His Tyr Asp Lys Thr Leu Leu Lys Thr Val Glu 
885 890 895 

Ser Thr Asp Ser Ser Ala Ser Val Val Arg Val Thr Pro Ser Gin Leu 

900 905 910 

Ser Ser He Arg Asn Ala Lys His Asp Pro Thr Val Met Gin Asn Phe 
915 920 925 

Glu Gin Ser Lys Ala Lys He Ala Thr Leu Asn Ser Leu Tyr Gly Leu 
930 935 940 

Asn He Asp Gin Leu Tyr Tyr Thr Thr Asp Lys Asp He Arg Tyr He 
945 950 955 ^ ^ 960 

Thr Asp Lys Val Asn Asn Met Tyr Gin Thr Thr Val Glu Leu Ala Tyr 
965 970 975 

Arg Ser Leu Leu Leu Gin Thr Arg Leu Lys Lys Tyr Val Tyr Ser Val 
980 985 990 

Asn Ala Lys Gin Phe Glu Gly Lys Trp Val Thr Asp Tyr Ser Arq Thr 
995 1000 1005 

Glu Ala Leu Phe Asn Ser Thr Phe Lys Gin Ser Pro Glu Asn Ala Leu 
1010 1015 1020 

Tyr Asp Leu Ser Glu Tyr Leu Ser Phe Phe Asn Asp Pro Thr Glu Trp 
1025 1030 1035 1040 

Lys Glu Gly Leu Leu Leu Leu Ser Arg Tyr He Asp Tyr Ala Lys Ala 
1045 1050 1055 

Gin Gly Phe Tyr Glu Asn Trp Ala Ala Thr Ser Asn Leu Thr He Ala 
1060 1065 1070 

Arg Leu Arg Glu Ala Gly Val He Phe Ala Glu Ser Thr Asp Leu Lys 
1075 1080 1085 

Gly Asp Glu Lys Asn Asn He Leu Leu Gly Ser Gin Lys Asp Asn Asn 
1090 1095 1100 
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Asp Thr Leu Lys Gly Ser Tyr Gly Ala Asp Thr Tyr lie Phe Ser Lys 

1125 1130 1135 

Gly His Gly Gin Asp lie Val Tyr Glu Asp Thr Asn Asn Asp Asn Arg 
1140 1145 1150 

Ala Arg Asp lie Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala Glu 
Ills 1160 1165 

Val Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr His 
1170 1175 1180 

Asp Thr Asp Ser Val Thr Val Lys Ser Phe Tyr Ser His Val Asp Tyr 
I1I5 1190 1195 1200 

Gin Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser lie Thr Arg Asp Glu 
1205 1210 1215 

Leu lie Lys Ala Gly Leu His Leu Tyr Gly Thr Asp Gly Asn Asp Asp 
1220 1225 1230 

He Lys Asp His Ala Asp Trp Asp Ser He Leu Glu Gly Gly Lys Gly 
1235 1240 1245 

Asn Asp lie Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He Phe Ser 
1250 1255 1260 

Lys Gly His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn Asp Asn 
1265 1270 1275 1280 

Arq Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp Val Asn Tyr Ala 
1285 1290 1295 

Glu Val Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe Gly Tyr 
1300 1305 1310 

His Asp Thr Asp Ser Val Thr He Lys Ser Phe Tyr Asn His Val Asp 
1315 1320 1325 

Tyr Gin Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser He Thr Arg Asp 
1330 1335 1340 

Glu Leu Gly Lys Gin Gly Met Ala Leu Phe Gly Thr Asp Gly Asp Asp 
1345 1350 1355 1360 

Asn He Asn Asp Trp Gly Arg Asn Ser Val He Asp Ala Gly Ala Gly 
1365 1370 1375 

Asn Asp Thr Val Asn Gly Gly Asn Gly Asp Asp Thr Leu He Gly Gly 
1380 1385 1390 

Lys Gly Asn Asp He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr He 
1395 1400 1405 

Phe Ser Lys Gly His Gly Gin Asp He Val Tyr Glu Asp Thr Asn Asn 
1410 1415 1420 

Asp Asn Arg Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp Val Asn 
1425 1430 1435 1440 

Tyr Ala Glu Val Lys Phe Arg Arg Val Asp Asn Asp Leu Met Leu Phe 
1445 1450 1455 

Gly Tyr His Asp Thr Asp Ser Val Thr Val Lys Ser Phe Tyr Ser His 
1460 1465 1470 

Val Asp Tyr Gin Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser He Thr 
1475 1480 1485 

Arg Asp Glu Leu He Lys Ala Gly Leu His Leu Tyr Gly Thr Asp Gly 
1490 1495 1500 

Asn Asp Asp He Lys Asp His Ala Asp Trp Asp Ser He Leu Glu Gly 
1505 1510 1515 1520 

Gly Lys Gly Asn Asp He Leu Arg Gly Gly Tyr Gly Ala Asp Thr Tyr 
1525 1530 1535 

He Phe Ser Lys Gly His Gly Gin Asp He Val Tyr Glu Asp Thr Asn 
1540 1545 1550 

Asn Asp Asn Arg Ala Arg Asp He Asp Thr Leu Lys Phe Thr Asp Val 

1 CCC 1 crn r^f 



60 

1570 1575 15B0 

Phe Gly Tyr His Asp Thr Asp Ser Val Thr He Lys Ser Phe Tyr Asn 
1585 1590 1595 1600 

His Val Asp Tyr Gin Phe Asp Lys Leu Glu Phe Ala Asp Arg Ser He 
1605 1610 1615 

Thr Arg Asp Glu Leu Gly Lys Gin Gly Met Ala Leu Phe Gly Thr Asp 
1620 1625 1630 

Gly Asp Asp Asn He Asn Asp Trp Gly Arg Asn Ser Val He Asp Ala 
1635 1640 1645 

Gly Ala Gly Asn Asp Thr Val Asn Gly Gly Asn Gly Asp Asp Thr Leu 
1650 1655 1660 

He Gly Gly Lys Gly Asn Asp He Leu Arg Gly Gly Tyr Gly Ala Asp 
1665 1670 1675 1680 

Thr Tyr He Phe Ser Lys Gly His Gly Gin Asp He Val Tyr Glu Asp 
' 1685 1690 1695 

Thr Asn Asn Asp Asn Arg Ala Arg Asp He Asp Thr Leu Lys Phe Thr 
1700 1705 1710 

Asp He Asn Leu Ser Glu Leu Trp Phe Ser Arg Glu Asn Asn Asp Leu 
1715 1720 1725 

He He Lys Ser Leu Leu Ser Glu Asp Lys Val Thr Val Gin Asn Trp 
1730 1735 1740 

Tyr Ser His Gin Asp His Lys He Glu Asn He Arg Leu Ser Asn Glu 
1745 1750 1755 1760 

Gin Thr Leu Val Ser Thr Gin Val Glu Lys Met Val Glu Ser Met Ala 

1765 1770 1775 

Gly Phe Ala Gin Lys His Gly Gly Glu He Ser Leu Val Ser Leu Glu 
1780 1785 1790 

Glu Val Lys Gin Tyr He Asn Ser Leu Thr Ala Ala Leu * 
1795 1800 1805 



